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PREFACE 


The present volume contains the more advanced parts of the 
differential and integral calculus, dealing mainly with functions 
of several variables. As in Volume IJ, I have sought to make 
definitions and methods follow naturally from intuitive ideas 
and to emphasize their physical interpretations—aims which 
are not at all incompatible with rigour. 

I would impress on readers. new to the subject, even more 
than I did in the preface to Volume I, that they are not expected 
to read a book like this consecutively. Those who wish to get a 
rapid grip of the most essential matters ‘should begin with 
Chapter II, and next pass on to Chapter IV; only then should 
they fill in the gaps by reading Chapter III and the appendices 
to the various chapters. It is by no means necessary that they 
should study Chapter I systematically in advance. 

The English edition differs from the German in many details, 
and contains a'good deal of additional matter. In particular, 
the chapter on differential equations has been greatly extended. 
Chapters on the calculus of variations and on functions of a 
complex variable have been added, as well as a supplement on 
real numbers. 

I have again to express my very cordial thanks to my German 
publisher, Julius Springer, for his generous attitude in con- 
senting to the publication of the English edition. I have also 
to thank Blackie & Son, Ltd., and their staff, especially Miss 
W. M. Deans, for co-operating with me and my assistants and 
relieving me of a considerable amount of proof reading. Finally, 
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. I must express my gratitude to the friends and colleagues 
who have assisted me in preparing the manuscript for the press, 
reading the proofs, and collecting the examples; in the first place 
to Dr. Fritz John, now of the University of Kentucky, and to 
Miss Margaret Kennedy, Newnham College, Cambridge, and also 
to Dr. Schénberg, Swarthmore College, Swarthmore, Pa. 


R. COURANT. 


New Rocuetrz, New York. 
March, 1936. 
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| _ CHAPTER I | 
Preliminary Remarks on Analytical 


Geometry and Vector Analysis — 

’ / . i ! ; A 

In the interpretation and application of the mathematical facts which 
form the main subject of this second volume it is often convenient to use 
the simple fundamental concepts of analytical geometry and vector 
analysis. Henoe, even though many readers will already have a certain 


knowledge of thesesubjects, it seems advisable to summarize their elements - 


in ® brief introductory chapter. This 

chapter, however, need not be studied . - 
before the rest, of the book is read; the ~ 
reader is advieed to refer to the facts. 
collected here only when he finds the Bags 
need of them in studying the later’parts he ene nap 
of the book.  — 


rin 


1; RecranautaR Co-oRDINATES 
AND VECTORS 

- To fix a point in a plane or in space, 
as is well known, we generally make use 
- of @ rectangular oo-ordinate system. In 
the plane we take two perpendicular ‘7 2 
 Tines, the z-axis and the y-axis; in space Fig. 1-—Co-ordinate axes in space 
we take three mutually perpendicular 
lines, the z-axia, thé. y-axis, and the z-axis. Taking the same unit of 


length’ on each axia, we assign to each point of the plane an 2-0o-ordinate 


ee ee 


and a y-co-ordinate in the usual way, or to each point in space an 
w:c0-ordinate, # y-co-ordinate, and a z-co-ordinate (fig. 1). Conversely, 
to every set of values (x, y) or (2; y, 2) there corresponds just one point . 
of the plane, or of space, as the case may be; a point is completely | 


Using the theorem of Pythagoras we find that the distarice between two 
_ points (xy ys) and (xy, y,) is given by 7 
a r= (a= BP + a — oP 


_ (e912) 


we kip gabe. Ss. 


2 ANALYTICAL GEOMETRY AND VECTORS [Cuap. 


while the distance between the points with co-ordinates (2), y,, 2,) and 
(as Yor 22) is d 


ra V (ae, — 2)? + (Yi — Ye)? + (a — &) 


In setting up a system of rectangular axes we must pay attention to 
the orientation of the co-ordinate system. 
In Vol. I, Chap. V, § 2 (p. 268) we distinguished between positive and 


Fig. 2.—Right-~handed system of axes Fig. 3.— Left-handed system of axes 


negative senses of rotation in the plane. The rotation through 90° which 
brings the positive x-axis of a plane co-ordinate system into the position of 
the positive y-axis in the shortest way defines a sense of rotation. According 
as this sense of rotation is positive or negative, we say that the system of 
axes is right-handed or left-handed. (cf. figs. 2 and 3). It is impossible to 
change a right-handed system into a left-handed system by a rigid motion 
~ confined to the plane. A similar distinction occurs with co-ordinate systems 


| , 


Yy a x 
x ¥ 
Fig. 4.—Right-handed screw Fig. 5.— Left-handed screw 

in space. For if one imagines oneself standing on the zy-plane with one’s 
head in the direction of the positive z-axis, it is possible to distinguish 
two types of do-ordinate system by means of the apparent orientation of 
the co-ordinate system in the zy-plane. If this system is right-handed the 
system in space is also said to be right-handed, otherwise left-handed 
. (of. figs. 4 and 5). A right-handed system corresponds to an ordinary right- _ 
handed screw; for if we make the zy-plane rotate about the z-axis (in the — 
sense prescribed by its orientation) and simultaneously give it a motion 
of translation along the positive z-axis, the combined motion is obviously 
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that of a right-handed screw. Similarly, a left-handed system corresponds 
to a left-handed screw. No rigid motion in three dimensions can transform 
a left-handed system into a right-handed system. 

In what follows we shall always use right-handed systems of axes. 

We may also assign an orientation to a system of three arbitrary axes 
passing through one point, provided these axes do not all lie in one plane, 
just as we have done here for a system of rectangular axes. 


2. Directions and Vectors. Formulse for Transforming Axes. 


An oriented line 7 in space or in a plane, that is, a line traversed in a 
definite sense, represents a direction; every oriented line that can be made 
to coincide with the line 7 in position 
and sense by displacement parallel to y 
; are 1 
itself represents the same direction. It 
is customary to specify a direction rela- 
tive to a co-ordinate system by drawing 


an oriented half-line in the given direc- > 
tion, starting from the origin of the 25 \ 


co-ordinate system, and on this half- % 
line taking the point with co-ordinates Fig. 6.—The angles Auieer lea 
(«, 8, y) which is at unit distance from ine 


the origin. The numbers «, 8, y are 
called the direction cosines of the direction. They are the cosines of the 
three angles 8,, 8,, 8; which the oriented line / makes with the positive 
x-axis, y-axis, and z-axis* (of. fig. 6); by the distance formula, they 
satisfy the relation 

ot + Bi+ y2= 1. 


Tf we restrict ourselves to the zy-plane, a direction can be specified by 
the angles §,, 8, which the oriented line J having this direction and 
passing through the origin forms with the positive z-axis and y-axis; or 
by the direction cosines « = cos8,, 8 = cos3,, which satisfy the equation 


a? + B= 1. 


A line-segment of given length and given direction we shall call a 
vector; more specifically, a bound vector if the initial point is fixed in space, 
and a free vector if the position of the initial point is immaterial. In the 
following pages, and indeed throughout most of the book, we shall omit 
the adjectives free and bound, and if nothing is said to the contrary we 
shall always take the vectors to be free vectors. We denote vectors by 
heavy type, e.g. a, 6, c, x, A. Two free vectors are said to be equal if 
one of them can be made to coincide with the other by displacement 
parallel to itself. We sometimes call the length of a vector its absolute 
value and denote it by | @ |. 


*The angle which one oriented line forms with another may always be 
taken as being between 0 and a, for in what follows only the cosines of such 
angles will be considered. 
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Tf from the initial and final points of a vector wv we drop perpen- 
diculars on an oriented line J, we obtain an oriented segment on / corre- 
sponding to the vector. If the orientation of this segment is the same as 
that of 1, we call its length the component of v in the direction of 1; if the 
orientations are opposite, we call the negative 
of the length of the segment the component of v 
in the direction of l. The component of v in the 


(J 
‘ 
\ 
v ' direction of 1 we denote by v;,. If 3 is the angle 

‘ between the direction of v and that of J (cf. 
' fig. 7), we always have 

|v jcosd—vi l v, = | v | cosd. 

Bigeye rolection ot awackor A vector wv of length 1 is called a unit vector. 


Its component in a direction / is equal to the 
cosine of the angle between / and v. The components of a vector v in the 
directions of the three axes of a co-ordinate system are denoted by 
Vy) Vg, Vs. If we transfer the initial point of v to the origin, we see that 


| o |= Vo2 + v,? + 9,7. 
If a, B, y are the direction cosines of the direction of wv, then 
%=| |e, »=[2|8, =| 2 ly. 


A free vector is completely determined by its components 0, U2, U9. 
An equation 
v= Ww 


between two vectors is therefore equivalent to the three ordinary equations 


y= Wy, 
VU, = We, 
Us = Wy 


There are two different reasons why the use of vectors is natural and 


a+(b+c)=(atb)+e 


Fig. 8.—Commutative law of vector Fig. 9.—Associative law of vector 
addition addition 


advantageous. Firstly, many geometrical concepts, and a still greater 
number of physical concepts, such as force, velocity, acceleration, &c., 
immediately reveal themselves as vectors independent of the particular 
co-ordinate system. Secondly, we can set up simple rules for calculating 
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with vectors analogous to the rules for calculating with ordinary numbers; 
by means of these many arguments can be developed in a simple way, 
independently of the particular co-ordinate system chosen. 

We begin by defining the sum of the two vectors @ and 5. For this 
purpose we displace the vector 6 parallel to itself until its initial point 
coincides with the final point of a. Then the initial point of @ and the 
final point of 4 determine a new vector c (see fig. 8) whose initial point 
is the initial point of a and whose final point is the final point of &. We | 
call c the sum of @ and 6 and write 


at+t+6é=cea 


For this additive process the commutative law 
a+b=ob+¢a4 


and the associative law 
a+(6+c)=(a+ 6)4+c=a+b+e 


obviously hold, as a glance at figs. 8 and 9 shows. 

From the definition of vector addition we at once obtain the “ projec- 
tion theorem ”: the component of the sum of two or more vectors in a direction 
1 is equal to the sum of the components of the individual vectors in that direc- 
tion, that is, 

(a+ 6), = a+ by 


Tn particular, the components of a + 5 in the directions of the co-ordinate 
axes are a, + by, dz + bg, 3 + by. 

To form the sum of two vectors we accordingly have the following 
simple rule. The components of the sum are equal to the sums of the corre- 
sponding components of the summands. 

Every point P with co-ordinates (x, y, z) may be determined by the 
position vector from the origin to P, whose componente in the directions of 
the axes are just the co-ordinates of the point P. We take three unit 
vectors in the directions of the three axes, e, in the x-direction, e, in the 
y-direction, @, in the z-direction. If the vector v has the components 
®%, Vg, Vg, then 

VD = V,@1 + Vgeq + Vgey 


We call v, = 0,61, V2 == U2, Ug = Vg the vector components of v. 
Using the projection theorem stated above, we easily obtain the irans- 
formation formule which determine (2’, y’, z’), the co-ordinates of a given 
point P with respect to the axes Ox’, Oy’, Oz’, in terms of (x, y, 2), its co- 
ordinates with respect to another set * of axes Ox, Oy, Oz which has the 
same origin as the first set and may be obtained from it by rotation. The 
three new axes form angles with the three old axes whose cosines may be 


*It is to be noted that in accordance with the convention adopted on 
p. 3 both systems of axes are to be right-handed. 
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expressed by the following scheme, where for example y, is the cosine of 
the angle between the z’-axis and the z-axis: 


Lelylz 

x’ | «| Bl va 

y | 2 | Bol v2 

z’|as| Bs| Ys 
From P we drop perpendiculars to the axes Ox, Oy, Oz, their feet being 
P,, Ps, Ps (of. fig. 1, p. 1). The vector from O to P is then equal to 
the sum of the vectors from O to P,, from O to P,, and from O to P;. The 
direction cosines of the «’-axis relative to the axes Ox, Oy, Oz are %, By Y1 
those of the y’-axis 0, Bz, ‘Y2, and those of the z’-axis xs, By, Ys. By the 
projection theorem we know that 2’, which is the component of the vector 


—e 
OP in the direction of the x’-axis, must be equal to the sum of the com- 
> 3S 
ponents of OP,, OP,, OP, in the direction of the 2’-axis, so that 
v= ye + By + v2, 


for «,% is the component of x in the direction of the x’-axis, and soon. 
Carrying out similar arguments for y’ and z’, we obtain the transformation 
formule 

w= oe + By + Y2 

y! = we + By + Ye 


z= age + Bay + 19% 
and conversely 


w= ye! + ay’ + age” 

y Biz’ + Bay’ + Bae’ 

z= ye + vey’ + Ys’ 

Since the components of a bound vector v in the directions of the axes 
are expressed by the formule 

O = Le — Uy 
= 2-1 
Us = 2% — Mp 


in which (2,, y,, 2%) are the co-ordinates of the initial point and (x, y2, 22) 
the co-ordinates of the final point of v, it follows that the same trans- 
formation formule hold for the components of the vector as for the 
co-ordinates: 

01 = &0 + Bie + ils 

U2’ = 2% + Bae + Yas 

Vg = Og + Bare + Yat 


3. Sealar Multiplication of Vectors. 


Following conventions like those for the addition of vectors, we now 
define the product of a vector v by a number c: if v has the components 
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1) Vy Vs, then cv is the vector with components cv, Cv, cvs. This de- 
finition agrees with that of vector addition, for v-+ v= 2v, u+v+ 
= 3v, and so on. If c¢ > 0, cw has the same direction as v, and is of length 
c|v|; ife < 0, the direction of cv is opposite to the direction of v, and its 
length is (—c)|v|. If c= 0, we see that cv is the zero vector with the 
components 0, 0, 0. 

We can also define the product of two vectors 2 and v, where this “ multi- 
plication ” of vectors satisfies rules of calculation which are in part similar 
to those of ordinary multiplication. There are two different kinds of 
vector multiplication. We begin with scalar multiplication, which is the 
simpler and the more important for our purposes. 

By the scalar product * uv of the vectors u and v we mean the product 
of their absolute values and the cosine of the angle 8 between their directions: 


uv = ||| v| cosd. 


The scalar product, therefore, is simply the component of one of the 
vectors in the direction of the other multiplied by the length of the second 
vector. 

From the projection theorem the distributive law for multiplication, 


(4+ v)w= uw-+ vw, 


follows at once, while the commutative law, 


uv = VU, 


is an immediate consequence of the definition. 

On the other hand, there is an essential difference between the scalar 
product of two vectors and the ordinary product of two numbers, for the 
product can vanish alihough neither factor vanishes. 

If the lengths of v and v are not zero, the product uv vanishes tf, and 
only if, the two vectors u and v are perpendicular to one another. 

In order to express the scalar product in terms of the components of 
the two vectors, we take both the vectors « and v with initial points at 
the origin. We denote their vector components by 2, uw, wu, and 
Uy, Vay Vg respectively, so that #2 = uw, + w+ wu, and.v= v, + v,+ v3. 
In the equation wv = (2, + m+ us)(¥, + ¥, + Vs) we can expand the 
product on the right in accordance with the rules of calculation which 
we have just established; if we notice that the products 2 v9, 24,U , 14,0, 
UyVg, 45%, and #3, vanish because the factors are perpendicular to one 
another, we obtain wv = 1,¥,-+ U,V. + t%43¥3. Now the factors on the 
right have the same direction, so that by definition u,v, = uv, &c., 
where %, Ug, Ug and 0%, V2, Vs are the components of 2 and v respectively. 
Hence 

UV = UzV, + Ugd, + Ugds. 


This equation could have been taken as the definition of the scalar product, 
and is an important rule for calculating the scalar product of two vectors 


* Often called the inner product. 
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given in terms of their components. In particular, if we take # and v as 
unit vectors with direction cosines 0, a %3 and B, Be, Bs respectively, 
the scalar product is equal to the cosine of the angle between # and v, 
which is accordingly given by the formula 


cos 8 = «8, + aR. -+ asBy. 


The physical meaning of the scalar product is exemplified by the fact, 
proved in elementary physics, that a force f which moves a particle of unit 
mass through the directed distance v does work amounting to /v. 


4. The Equations of the Straight Line and of the Plane. 


Let a straight line in the zy-plane or a plane in xyz-space be given. 
In order to find their equations we erect a perpendicular to the line (or 


¥. 
n 


(ee 


Fig. 10.—The equation of a straight line 


the plane) and specify a definite “ positive direction along the normal”, 
perpendicular to the line (or plane); it does not matter which of the two 
possible directions is taken as positive (cf. fig. 10). The vector with unit 
length and the direction of the positive normal we denote by #. The points 
of the line (or plane) are characterized by the property that the position 
vector x from the origin to them has a constant projection p on the direc- 
tion of the normal; in other words, the scalar product of this position 
vector and the normal vector ~ is constant. If «, B (or «, B, y) are the 
direction cosines of the positive direction of the normal, that is, the com- 
ponents of , then 
an -+- By— p=0 


(or =a + By + ye — p=0) 


is the required equation of the line (or plane). Here p has the following 
meaning: the absolute value | p| of p is the distance of the line (or plane) 
from the origin. Moreover, p is positive if the line (or plane) does not 
pass through the origin and # is in the direction of the perpendicular 
from the origin to the line (or plane); p is negative if the line (or plane) 
does not pass through the origin and » has the opposite direction; p is 
zero if the line (or plane) passes through the origin. Conversely, if «, B 
(or «, ®, y) are direction cosines, this equation represents a line (or plane) 
at a distance p from the origin, whose normal has these direction cosines. 
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The expression ax + By — p (or ax + By + yz — p) on the left-hand 
side of this so-called normal or canonical form of the equation of the 
straight line (or plane) also has a geometrical meaning for any point P 
(x, y) not lying on the line (or plane). Since ax -+ By (or ax -+- By + yz) is 
the projection of the position vector from O to P on the normal, we see 
at once that the expression ax + By — p (or ax+ By + yz— p) ta the 
perpendicular distance of the point P from the line (or plane) and is positive 
for points on one side of the line or plane (namely, that on which the normal 
ts positive) and negative for points on the other side. 

From the canonical form of the equation we obtain other forms of 
equation for the straight line (or plane) by multiplying by an arbitrary 
non-vanishing factor. Conversely, an arbitrary linear equation 


Az + By + D=0 (or Ax + By + Cz + D=0) 


represents a straight line (or plane) provided the coefficients A, B (or 
A, B, C) are not all zero.* In the second of these equations, for example, 


we may divide by ~/ A® + B? + C? and put 


A B 
‘= Tapepo PO Vepepo 


Cc D 
Y = ———— = SOS, 
Vat eto ~~ Jat B+ oF 
In this way we obtain an equation which is seen to represent a plane at 
a distance p from the origin, whose normal has the direction cosines 
a, B, y. Corresponding remarks hold for the equation of the straight line. 


A straight line in space may be determined by any two planes passing 
through the line. For a line in space we thus obtain two linear equations 


A,z+ By + Cyz+ D, = 0, 
A + By + Cz+ D,=9, 


which are satisfied by (x, y, z), the co-ordinates of any point on the line. 
Since an infinite number of planes pass through a given line, this repre- 
sentation of a line in space is not unique. 

Frequently it is more convenient to represent a line analytically in 
parametric form by means of a parameter t. If we consider three linear 
functions of ¢, 

w= a, + byt, 
y = a, + bt, 
z= a3 + bgt, 


where the 6’s are not all zero, then as ¢ traverses the number axis the point 
(x, y, z) describes a straight line. This we see at once by eliminating ¢ 
between each pair of equations, whereby we obtain two linear equations 
for x, y, 2. 


*If A = B= 0 (or A = B = C = 0), D must also be zero, and any point 
of the plane (or of space) satisfies the equation. 
2° (8912) 
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The direction cosines «, 8, y of the line in its parametric form are 
proportional to the coefficients b,, 6,, b3. For these direction cosines are 


O 


Fig. 11.—Parametric representation of a straight line passing through two points 


proportional (cf. fig. 11) to 7, — %2, y; — Yo 2% — %, the differences of the 
co-ordinates of two points P,, P, with co-ordinates 
Wy = A+ Dt, y= Oy + Og, 2% = ag t Ost 
and 
Bq == Ay + Dye, Ya = My + Ooty, 2% = As + gtr. 
Hence 
P,P, cos 8, = 1% — %, = 6,(t, — 4), 
P,P, cos 8, = y2 — Y1 = bolt, — by), 
P,P, cos 33 = % — 2% = b3(t, — &), 


where P,P, denotes the length of the segment P,P,. Consequently 


t, — t 
a= eb, = pbs, = od. where 9 = 2. ), 
eb, B= pb, y= pbs ( e =) 


Since the sum of the squares of the direction cosines is unity, it follows that 


b be. bs 


1 = . oe 
Virb irrhe PW aVeet ieee TO LVorp op oe 


where the double sign of the square root corresponds to the fact that we 
can choose either of the two possible senses on the line. 

By means of the direction cosines we can easily bring the parametric 
representation of the line into the form 


B= % + aT, 
¥=Y% + Br, 
2=% + Yt 


where (%y, Y, %) is a fixed point on the line; the new parameter + is con- 
nected with the previous parameter ¢ by the equation 


y+ at = a, + Dye. 
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From the fact that o? + 8? + y?= 1 it follows that 
v4 = (x — a)" + (y — Yo)* + (2 — %)* 


Hence the absolute value of + is the distance between (x, yy, 2) and 
(x, y, 2). The sign of + indicates whether the direction of the line is from the 
point (xo, Yo, %) to the point (x, y, 2), or vice versa; in the first case t is 
positive, in the second negative. 

From this we obtain a useful expression for (2, y, 2), the co-ordinates 
of a point P on the segment joining the points P4(2», Yo, 2p) and P,(x, ¥4, 2), 
namely, 


B= yy + AW, Y= AYot Yr, 2 =Agto + Ary 


where Ay and A, are positive and 4) + A; = 1. If + and +, denote the dis- 


tances from P, of the points P and P, respectively, we find that 4)—= 1— as 
Ty 
7 


and A, = For if we calculate «, say, from x, = %) + at,, and sub- 


% 
stitute this value, « = (x, — %)/T,, in the equation z = 2% + at, we obtain 
the expression given above. 

Let a straight line be given by 


2= 2+ at, 
=%t+ Br, 
z= % + YT 
We now seek to find the equation of the plane which passes through the point 
(%, Yor 29) and is perpendicular to this line. Since the direction cosines of 
the normal to this plane are a, B, y, the canonical] form of the required 
equation is 
ax+ By + yze—p=0, 


and since the point (2, Yo, 2%) lies on the plane 
P = a% + By + y2o- 


The equation of the plane through (x, Yo, 2) perpendicular to the line 
with direction cosines «, B, y is therefore 


a(% — a) + By — Yo) + y(% — 2%) = 0. 


In the same way, the equation of a straight line in the zy-plane which 
passes through the point (2, yo) and is perpendicular to the line with 
direction cosines a, B is 


a(a — x) + Bly — Yo) = 0. 
Later we shall need a formula for 8, the angle between two planes given 
by the equations 
ax + By + yz —p =0. 
aa+ By + yz— p= 0. 
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Since the angle between the planes is equal to the angle between their 
normal vectors, the scalar product of these vectors is cos3, so that 


cos 8 = aa’ + BR’+ yy’. 
In the same way, for the angle 3 between the two straight lines 
ox -+ By— p= Oand oa + Py — p’=0 


in the zy-plane we have 
cos § = aa’-+ BP’. 


EXxaMP.LEs 


1. Prove that the quantities a, «2, ...» Ys (p. 6), defining a rotation 
of axes, satisfy the relations 


O1%_ + BiB. + Y1¥2 = 0, a? + B+ y?= 1, 
Gets + Bobs + Y2¥s = 0, a? + Bo? + y.? = 1, 
G30, + BsBi + Ysv1 = 0, as? + Bs? + ys? = 1. 


2. If aand 8 are two vectors with initial point O and final points A and 
B, then the vector with O as initial point and the point dividing AB in the 
ratio 6: 1 — 6 as final point is given by 


(1 — 6)a + 08. 


3. The centre of mass of the vertices of a tetrahedron PQRS may be 
defined as the point dividing MS in the ratio 1:3, where M is the centre 
of mass of the triangle PQR. Show that this definition is independent of 
the order in which the vertices are taken and that it agrees with the general 
definition of the centre of mass (Vol. I, p. 283). 

4. If in the tetrahedron PQRS the centres of the edges PQ, RS, PR, 
QS, PS, QR are denoted by A, A’, B, B’, C, C’ respectively, then the lines 
AA’, BB’, CC’ all pass through the centre of mass and bisect one another 
there. 

5. Let P,,..., P, be m arbitrary particles in space, with masses 
My, Mz .. +5 My, respectively. Let G be their centre of mass and let 
py «++, fy Aenote the vectors with initial point G and final points 
Py, ...,P,. Prove that 


Mp + Mop, + +++ Myady = 0. 
2. Tue Arka or A TRIANGLE, THE VOLUME OF A TETRAHEDRON, 
THE VECTOR MULTIPLICATION OF VECTORS 


l. The Area of a Triangle. 


In order to calculate the area of a triangle in the zy-plane we imagine 
it moved parallel to itself until one of its vertices is at the origin; let the 
other two vertices be P,(x,, y;) and P,(22, yg) (cf. fig. 12). We write down 
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the equation of the line joining P, to the origin in its canonical form 
—" nas 
4. x + tL _y = 0; 
Vattyt Vetta” 
hence for the distance hf of the point P, from this line we have (except 
perhaps for sign) the expression 
hice. —Yi% Yo 
x3 + y? Vx? + uF 
Since the length of the segment OP, is \/z + 9,3, we find that twice the 


, oR 
# 
0 2 0 


Fig. 12.—To illustrate the method for Fig. 13.—Determination of the sign of the 
finding the area of a triangle area of a triangle 


& 


area of the triangle, which is the product of the “base” OP, and the 
altitude h, is given (except perhaps for sign) by the expression 


2A = BY_ — TY 


This expression can be either positive or negative; it changes sign if 
we interchange P, and P,. We now make the following assertion. The 
expression A has a positive or negative value according as the sense in which 
the vertices OP,P, are traversed is the same as the sense of the rotation 
associated with the co-ordinate axes, or not. Instead of proving the fact by 
more detailed investigation of the argument given above, which is quite 
feasible, we prefer to prove it by the following method. We rotate the tri- 
angle OP,P, about the origin O until P, lies on the positive x-axis. (The 
case in which O, P,, P, lie on a line, so that A = (214, — 2241) = 0, can be 
omitted.) This rotation leaves the value of A unaltered. After the rotation 
P, has the co-ordinates 2,’ > 0, y,’ = 0, and the co-ordinates of the new 
P, are 2,’ and y,’. The area of the triangle is now 


1 
A= 5 %y'Yo'» 


and therefore has the same sign as y,’. The sign of ys’, however, is the 
same as the sign of the sense in which the vertices OP,P, are traversed 
(cf. fig. 13). Our statement is thus proved. 
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For the expression 2,y, — 2,y,, which gives twice the area with its 
proper sign, it is customary to introduce the symbolic notation 


XU Xs : 

Yr Ya 

which we call a two-rowed determinant, or determinant of the second order. 
If no vertex of the triangle is at the origin of the co-ordinate system, 


e.g. if the three vertices are (x9, Yo), (1, Y1)s (es Ye), by moving the axes 
parallel to themselves we obtain the formula 
1 


2 
for the area of the triangle. 


Yo — efi = 


Hy — Uy Ly — My 
¥— Yo Ye— Yo 


2. Vector Multiplication of two Vectors. 


In addition to the scalar product of two vectors we have the important 
concept of the vector product.* The 
vector product [@6] of the vectors @ 
and & is defined as follows (cf. fig. 14): 

We measure off @ and 6 from a 
[a b} point O. Then @ and 6 are two sides 
of a parallelogram in space. The vector 
product [@5]= ec is a vector whose 
length is numerically equal to the area 
of the parallelogram and whose direc- 
tion is perpendicular to the plane of the 
parallelogram, the sense of direction 
being such that the rotation from a to 

6 and c = [a6] is right-handed. (That 

is, if we look at the plane from the 

fina] point of the vector c, we see the 


Fig. 14.—Vector product of two ; : 7 
vectors 4 and 6 shortest rotation from the direction of a 


to that of 5 as a positive rotation.) If 
aand 6 lie in the same straight line, we must have [a5] = 0, since the 
area of the parallelogram is zero. 


Rules of Calculation for the Vector Product. 


(1) If a + O and 5 + 0, then [a6] = 0 if, and only if, a@ and 5 have the 
same direction or opposite directions. 
For then, and only then, the area of the parallelogram with @ and 6 
as sides is equal to zero. 
(2) The equation 
[a6] = —[da] 
holds, 


* Often called the outer product; other notations in use for it are @ x 6, 
ans. 
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This follows at once from the definition of [@]. 
(3) If a and 6 are real numbers, then 


[ea bd] = ab [ad]. 


For the parallelogram with sides aa and 06 has an area ab times as 
great as that of the parallelogram with sides @ and 5 and lies in the same 
plane as the latter. 

(4) The distributive law holds: 


[a(6 + c)] = [a6] + [ac], [(2 + c)a] = [ba] + [ca]. 


We shall prove the first of these formule; the second follows from it 
when rule (2) is applied. 

We shall now give a geometrical construction for the vector pro- 
duct [25] which will demonstrate the truth of the distributive law 
directly. : 

Let # be the plane perpendicular to @ through the point O. We project 


B 
b 


0 


Fig. 15 —To show that [#6] = [ab 


6 orthogonally on FZ, thus obtaining a vector 6’ (cf. fig. 15). Then [a6] 
= [a6], for in the first place the parallelogram with sides a and & has the 
same base and the same altitude as the parallelogram with sides @ and 
8’; and in the second place the directions of [@8’] and [@8] are the same, 
since @, 5, 5’ lie in one plane and the sense of rotation from @ to 5’ is the 
same as that from @ to &. Since the vectors a and 6 are sides of a rect- 
angle, the length of [@b’] = [26] is the product |a||6’|. If, therefore, 
we increase the length of 5’ in the ratio |@|:1, we obtain a vector 5” 
which has the same length as [@6’]. But [a6] = [@d’) is perpen- 
dicular to both @ and 6, so that we obtain [a6] = [@’] from 6” by a 
rotation through 90° about the line a. The sense of this rotation must be 
positive when looked at from the final point of a. Such a rotation we shall 
call a positive rotation about the vector a as axis. 

We can therefore form [22] in the following way: project & orthogon- 
ally on the plane 2, lengthen it in the ratio | a@|:1, and rotate it positively 
through 90° about the vector a. 

To prove that [a(5+ c)] = [@d]+ [ac] we proceed as follows: 5 
and c are the sides OB, OC of a parallelogram OBDO, whose diagonal OD 
is the sum 6+ c. We now perform the three operations of projection, 
lengthening, and rotation on the whole parallelogram OBDC instead of on 
the individual vectors 6, c, 6-+ ¢; we thus obtain a parallelogram 
OB,D,C, whose sides OB,, OC, are the vectors [@5] and [ac] and whose 
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diagonal is the product [a(S + c)]. From this the equation [a6] + [ac] 
= [a(5 + c)] clearly follows (cf. fig. 16). 


Fig. 16.—Distributive law for the vector product of two vectors a and b 


(5) Let @ and 5 be given by their components along the axes, a, a2, as 
and 5,, b,, bs respectively. What is the expression for the vector product 
[a5] in terms of the vector components? 

We express @ by the sum of its vector components in the directions of 
the axes. If e,, e,, eg are the unit vectors in the directions of the axes, 
then 

@= Ge, + Agey + Bsa 
and similarly 
5 = be, + be, + bse. 


By the distributive law we obtain 


[25] = [(a,2;) (b¢1)] + [(a12,) (bgee)] + [(a14,) (bs@s)) 
+ [(ag¢2) (61¢1)] + [(@22) (02¢2)] + [(@2¢2) (b3¢5)] 
+ [(a@s@3) (b121)] + [(43s) (b2@2)] + [(ase5) (bges)}, 


which by rules (1) and (3) may also be written 


[ab] = a,b,[e,¢2] + ab,[e,¢] + ab,[e,€,] 
+ agbs[e,¢,] + a 9b,[ese,] + agb.[ese,]. 


Now from the definition of vector product it follows that 
@, = [@2¢3] = —[es@q], ee: = [23¢,] = —[e,e3], @3 = [e,e,] = —[e,e,}. 
Hence 
[@6] = (aby — agb,)e, + (ab; — a1b3)e, + (ab, — ab, ey. 
The components of the vector product [ab] = c are therefore 
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G, ag 


by bs 


¢ = 


, Q= 


In physics we use the vector product of two vectors to represent a 
moment. A force f acting at the final point of the position vector x has the 
moment [| fx] about the origin. 


3. The Volume of a Tetrahedron. 


We consider a tetrahedron (cf. fig. 17) whose vertices are the origin 
and three other points P,, P,, P; with co-ordinates (1, ¥4, 21)» (Xa Yar 22)» 


x 


y 


Fig. 17.—Determination of the volume of a tetrahedron 


(2g Yq» %) Fespectively. To express the volume of this tetrahedron in terms 
of the co-ordinates of its vertices we proceed as follows. The vectors 
x, = OP, and x, = OP, are sides of a triangle whose area is half the 
length of the vector product [x,%x,]. This vector product has the direction 
of the perpendicular from P, to the plane of the triangle OP,P,; h, the 
length of this perpendicular (the altitude of the tetrahedron), is therefore 
given by the scalar product of the vector +, = OP, and the unit vector 
in the direction of [x,x,]; for 4 is equal to the component of OP, in the 
direction of [x]. Since the absolute value of [12] is twice the area A 
of the triangle OP,P,, and since the volume V of the tetrahedron is equal 
to 44h, we have 


V = 3 ([%,%.]% 5). 
Or, since the components of [x,x,] are given by 


my YM , 
Te Y2 


w 4% co gs 
> 


Yo % 


Ze 


we can write 
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a | + Ys % YH 
Zo, 


v= {2 \. 
6 Ye % Ye 


This also holds for the case in which O, P,, P, lie on a straight line; in 
this case, it is true, the direction of [,,] is indeterminate, so that h can 
no longer be regarded as the component of OP, in the direction of [x1], 
but nevertheless A = 0, so that V = 0, and this follows also from the above 
expression for V, since in this case all the components of [2,2,] vanish. 

Here again the volume of the tetrahedron is given with a definite sign, 
as the area of the triangle was on p. 13; and we can show that the sign 
is positive if the three axes OP,, OP, ,OP, taken in that order form a system 
of the same type (right-handed or left-handed, as the case may be) as the 
co-ordinate axes, and negative if the two systems are of opposite type. 
For in the first case the angle § between [x,x,] and x, lies in the interval 


% age 
% 


y 
Xe 


osss - and in the second case in the interval 5 <8 <1, as follows 


immediately from the definition of [,2,], and V is equal to 


| [322] | | x3 | cosd. 
The expression 
2 


+ 2 


v4 | 


2 Xs tT Ye 


occurring in our formule may be expressed more briefly by the symbol 
yn a 
% Yo 21, 
3 Ys 23 
which we call a three-rowed determinant, or determinant of the third order. 
Writing out the two-rowed determinants in full, we see that 
1H % 
Ly Yo 2 | = LyYyhq — LyYor% 1 LoYgt1 — WYstq + TYots — BeYs%s 
Zs Ys 2 
Just as in the case of the triangle, we find that the volume of the tetra- 
hedron with vertices (Xp, Yo, 2)» (Ts Yi» 21)» (es Yor Za)s (Lar Yur 2g) i8 
1| 27% W— Yo %— % 
V=5% %—% Yo-Yo %— % |. 
%y— 2% Ys— Yo %— % 


EXaMpPtes * 


1. What is the distance of the point P(x, ¥q, 29) from the straight line 
1 given by 
aw=a+b, y=et+d, z=ea+f? 


* The more difficult examples are indicated by an asterisk. 
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2*, Find the shortest distance between two straight lines / and /’ in 
space, given by the equations 


a=a+tb a=a'tt b’ 
y=a+d and y=ct+ad 
z=ead+f z= ett fr. 


3. Show that the plane through the three points (2, 41, 21), (%a> Yes 2e)s 
(Xa Yg> 2g) is given by 
%— % w-y Bz 
t— & Yo—-Y #_—2|= 9. 
tz — % ¥s—— ¥ %— @ 
4. In a uniform rotation let («, 8, y) be the direction cosines of the axis 


of rotation, which passes through the origin, and w the angular velocity. 
Find the velocity of the point (2, y, z). 


5. Prove Lagrange’s identity 
[xP = | x P|» P — (xr). 
6. The area of a convex polygon with the vertices P,(2,, 91), Po(%2, Yo); 
«+s Paty Yn) is given by half the absolute value of 
x, 


T1 Xy 


Yn-1 Yn 


% MX + 
41 Ye 


T, Xs 
Ye Ys 


T,X 
Yn Yr 


+e t 


3. SmumpLE THEOREMS ON DETERMINANTS OF THE SECOND 
AND THIRD ORDER 


1. Laws of Formation and Principal Properties. 


The determinants of the second and third order occurring in the cal- 
culation of the area of a triangle and the volume of a tetrahedron, together 
with their generalization, the determinant of the nth order, or n-rowed deter- 
minant, are very important in that they enable formal calculations in all 
branches of mathematics to be expressed in a compact form. Here we 
shall develop the properties of determinants of the second and third order; 
those of higher order we shall need but seldom. It may, however, be 
pointed out that all the principal theorems may be generalized at once 
for determinants with any number of rows. For the theory of these we 
must refer the reader to books on algebra and determinants.* 

By their definitions (pp. 14, 18) the determinants 


b abe 
eal om lees 
: g hk 


*Cf. eg. H. W. Turnbull, The Theory of Determinants, Matrices, and In- 
variants (Blackie & Son, Ltd., 1929). 
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are expressions formed in a definite way from their elements a, b, c, d and 
a, b, c, d, e, f, g, h, k respectively. The horizontal lines of elements (such 
as d, e, f in our example) are called rows and the vertical lines (such as 
¢, f, &) are called columns. 

We need not spend any time in discussing the formation of the two- 
rowed determinant 


For the three-rowed determinant we give the “diagonal rule” which 
exhibits the symmetrical way in which the determinant is formed: 


N25 
CON 
Me EN 


ae (4) 


We repeat the first two columns after the third and then form the 
product of each triad of numbers in the diagonal lines, multiply the pro- 
ducts associated with lines slanting downwards and to the right by -+1, 
the others by —1, and add. In this way we obtain 


coe gpa Meet Bio + oa 
5 Bok " —ceg — afh — bdk. 


We shall now prove several theorems on determinants: 
(1) If the rows and columns of a determinant are interchanged, the value 
of the determinant is unaltered. That is, 


abe adg 
de fi= e hi. 
g hk k 


c f 


This follows immediately from the above expressions for the determinants. 
(2) If two rows (or two columns) of a determinant are interchanged, the 
sign of the determinant is altered, that is, the determinant is multiplied by 
—1. 
In virtue of (1) this need only be proved for the columns, and it can 
be verified at once by the law of formation of the determinant given above, 
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(3) In section 2 (p. 18) we introduced three-rowed determinants by the 
equations 


1 mH 

W. %4 y my 
x, Z| = 2 25° < 
a act 5 lye 2 + Ys 2%, T % % Yo 
% Ys % 

Using (2), we write this in the form 

Um % 

4 My ma 
x Zp | == & = ° 
2 Ye 2 3 ly, *| Y3 a % + % te yal? 
% Ys *% 


then in the determinants on the right the elements are in the same order 
as on the left. If we interchange the last two rows and then write down 
the same equation, using (2), we obtain: 


- n : Bry m1 % 2 ce mm 
acca) We "Ilys 2 2 lay 2 a %_ Ys 
% Ys 2 
and similarly 
i : . ep Yo %2) UV, %s Ts | 
Bees TlYs % 1) ag %, t Ys 
Ys % 


We call these three equations the expansion in terms of the elements of the 
third row, the second row, and the first row respectively. By interchanging 
columns and rows, which according to (1) does not alter the value of the 
determinant, we obtain the expansion by columns, 


x. 
el | afin M4 | 4 
Bogee aa 1ly3 % = Ilys 2s Sly, fay 
Tt Ys 2% 
m1 Mm 71 Ly Xe Ly My % 
Te Ye %i = —-N ts % + % ts z5| _ 2) 
%% Ys % 

2 
og eis yl wali, 4 4), m1 
raat 11a Ys "lay Ys Slate Ye 
ts Ys % 


An immediate consequence of this is the following theorem: 

(4) If all the elements of one row (or column) are multiplied by a number 
9, the value of the determinant is multiplied by p. 

From (2) and (4) we deduce the following: 

(5) If the elements of two rows (or two columns) are proportional, that is, 
if every element of one row (or column) is the product of the corresponding 
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element in the other row (or column) and the same factor p, then the de- 
terminant ts equal to zero. 

For according to (4) we can write the factor outside the determinant. 
If we then interchange the equal rows, the value of the determinant is 
unchanged, but by (2) it should change sign. Hence its value is zero. 

In particular, a determinant in which one row or column consists 
entirely of zeros has the value zero, as also follows from the definition 
of a determinant. 

(6) The sum of two determinants, having the same number of rows, which 
differ only in the elements of one row (or column) is equal to the determinant 
which coincides with them in the rows (or columns) common to the two de- 
terminants and in the one remaining row (or column) has the sums of the 
corresponding elements of the two non-identical rows (or columns). 

For example: 


abe amie abtme 
de fitid nu fl=id e+n f 
g hk g pk g h+pk 


For if we expand in terms of the rows (or columns) in question, which 
in our example consist of the elements 6, e, 4 and m, n, p respectively, and 
add, we obtain the expression 
ae 


ad f 


d 
(—o—m)|? tle etm|* tft aap) 


which clearly is just the expansion of the determinant 


abt+m ec 
d etn ff 
g htp &k 


in terms of the column b+ m, e+, 4+. This proves the state- 
ment. 

Similar statements hold for two-rowed determinants. 

(7) If to each element of a row (or column) of a determinant we add the 
same multiple of the corresponding element of another row (or column), the 
value of the determinant is unchanged. 

By (6) the new determinant is the sum of the original determinant and 
a determinant which has two proportional rows (or columns); by (5) 
this second determinant is zero.* 


* The rule for expansion in terms of rows or columns may be extended to 
define determinants of the fourth and higher order. Given a system of sixteen 
numbers 

a b Gy dy 


@, by Cy dy 
a, by Cy dy f’ 
a, by ty 


for example, we define a determinant of the fourth order by the expression 
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The following examples illustrate how the above theorems are applied 
to the evaluation of determinants. We have 
aoo 
0 e 0O| =aek, 
008k 


as we can prove by the diagonal rule. A determinant in which the 
elements in the so-called principal diagonal alone differ from zero is equal 
to the product of these elements. 

Evaluation of a determinant: 


1 1-1 2 00 
1-1 1/= 1 —1 1] (second row added to the first), 
-—l1 1 1 -1 141 
2 00 11 
1-1 lj= 2 ll (expansion in terms of the first row), 
1141 
Hence 1 431-1 
}-—-1l lj=-4 
-1 21 #1 
Another example is 
lez #2 1 x a 1 x oad 
Ly ¥l=|0 y—au xY—etl=—)10 y—a y— ah). 
lz 2 1 z 2 lo z—a 2 — 2% 
If we now expand this in terms of the first column we obtain 
y—a pe Ll yt+2(_a_ = = 
ae A a|= 0-2) (@—x)], “es {y — x) (z— a) (z — y). 


2. Application to Linear Equations. 


Determinants are of fundamental importance in the theory of linear 
equations. In order to solve the two equations 


az + by = A, 
ca + dy = B, 


for z and y, we multiply the first equation by c and the second by a, and 


by Cg dy G@, Cy dy a, by dy @z by Ci 
a, |b, ¢3 dy| —b |G, cy ds| +e, |G, bs dz] —dy | as bs cg}; 
bg % a a & a ah a, by &% 

and similarly we can introduce determinants of the fifth, sixth, ..., mth order 


in succession. It turns out that in all essential properties these agree with the 
determinants of two or three rows. Determinants of more than threo rows, 
however, cannot be expanded by the “ diagonal rule”. We shall not consider 
further details here. 
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subtract the second from the first; then we multiply the first equation by 
d@ and the second by b and subtract. We thus obtain 

(be — ad)jy = Ac — Ba 

(ad — be)x = Ad — Bb, 


or b A b a A 
d!| |B d| Bi’ 
Hf we assume that the determinant 
a b 
ed 
is different from zero, these equations at once give the solution 
A ob aA 
Bd _ fe B 
ala. bl fe OT 
e od c od 


which can be verified by substitution. If, however, the determinant 
a b 
ed 


vanishes, the en 
lA a 
Bd 


a A 
B 


ieee 
Ye al= 


aa 


would lead to a contradiction if either of the determinants and 


were different from zero. If, however, 


A bl] _ 
B dj |e 
our formulz tell us nothing about the solution. 

We therefore obtain the fact, which is particularly important for our 
purposes, that a system of equations of the above form, whose determinant is 
different from zero, always has a unique solution. 

If our system of equations is homogeneous, that is, if A = B= 0, our 


calculations lead to the solution x = 0, y = 0, provided that : : | + 0. 


Ba 


A 
= 0, 
2| 


For three equations with three unknowns, 
de+eayt+fe=B 
gz + hy + ke= 0, 
a similar discussion leads to a similar conclusion. We multiply the first 
> ¢) the third by F 
ef 


equation by F f , the second by — hk? » and 


hk 
add, thus obtaining 
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{els a]—la al tele g)} 
+ofels lea elt le gl} 
elle ala alte hola abak altel st 


But by our formule for the expansion of a determinant in terms of the 
elements of a column, this equation can be written in the form 


abe b be c bc Abe 
wide fityle e fl+zif e fl[=|B e fi. 
g hk hhk khk Chk 
By rule (4) the coefficients of y and z vanish, so that 
abe Abe 
aj\d e fi=|B e fl. 
ghk Chk 


In the same way we derive the equations 


abe aAe 
y{d e fi=j|d B fl, 
ghk g9 Ck 
abe abA 
zi\d e fi=|de Bl 
g ku k gh @ 
[If the determinant 

abe 

def 

g hk 


is not zero, the last three equations give us the value of the unknowns. 
Provided. that this determinant is not zero, the equations can be solved 
uniquely for x, y, z. If the determinant is zero, it follows that the right- 
hand sides of the above equations must also be zero, and the equations 
therefore cannot be solved unless A, B, C satisfy the special conditions 
which are expressed by the vanishing of every determinant on the 
right. 

If, in particular, the system of equations is homogeneous, so that 
4=B=C= 0, and if its determinant is different from zero, it again 
follows that = y = z= 0. 

In addition to the cases above, in which the number of equations is 
equal to the number of unknowns, we shall occasionally meet with 
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systems of two (homogeneous) equations with three unknowns, e.g. 
az + by + cz= 0, 
dz + ey + fze= 0. 
If the three determinants 
be ca 
ef fd 
are not all zero, if, for example, D, + 0, our equations can first be solved 
for x and y; this gives 


ab 


a de 


», D= » Da= 


or 
w@:y:z= D,: D,: D3. 


Geometrically this has the following meaning: we are given two vectors 
u and v with the components a, b, c and d, e, f respectively. We seek a 
vector x which is perpendicular to # and v, that is, which satisfies the 


equations 
ux=0, vx=0. 


Thus x is in the direction of [wv]. 


EXaMPLes 
1. Show that the determinant 
ab 
de 
g h 


Qa 


ay 


can always be reduced to the form 


a 0 0 

0p 0 

00 ¥ 
merely by repeated application of the following processes: (1) interchang- 
ing two rows or two columns, (2) adding a multiple of one row (or column) 
to another row (or column). 


2. If the three determinants 


a, a 
5, 


Gy Cy 


a, a, 
Go 


> > 


do not all vanish, then the necessary and sufficient condition for the 
existence of a solution of the three equations 

a,x + ay=d 

6,2 + by = e 

oe + cy = f 
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a% a d 
is D=\|b, & ej=0. 
4 ¢ f 
3. State the condition that the two straight lines 
a= at +b, a= ot+d, 
y=at+b, and y= ot+dy 
2 == ast + bs z= et + dz 


either intersect or are parallel. 


4*, Prove the properties (1) to (7), given on pp. 20-22, for deter- 
minants of the fourth order (defined on p. 22 (footnote)). 

5. Prove that the volume of a tetrahedron with vertices (2, y,, 2%), 
(a5 Yor 22)s (Xas Yar Zs)» (Ta Yar 24) 18 given by 


ym 4 
1 Fy Ye 2 
©l% Ys 2 
Tq Yq % 


et 


4, AFFINE TRANSFORMATIONS AND THE MULTIPLICATION 
oF DETERMINANTS 


We shall conclude these preliminary remarks by discussing the simplest 
facts relating to the so-called affine transformations; at the same time 
we shall obtain an important theorem on determinants, 


1, Affine Transformations of the Plane and of Space. 


By a mapping or transformation of a portion of space (or of a plane) 
we mean @ law by which each point has assigned to it another point of 
space (or point of the plane) as image point; the point itself we call the 
original point, or sometimes the model (in antithesis to the image). We 
obtain a physical expression of the concept of mapping by imagining that 
the poriion of space (or of the plane) in question is occupied by some 
deformable substance and that our transformation represents a deformation 
in which every point of the substance moves from its original position 
to a certain final position. 

Using a rectangular system of co-ordinates, we take (x, y, z) as the co- 
ordinates of the original point and (2, y’, 2’) as those of the corresponding 
image point. 

The transformations which are not only the simplest and most easily 
understood, but are also of fundamental importance for the general case, 
are the affine transformations. An affine transformation is one in which 
the co-ordinates (x’, y’, 2’) (or in the plane (2’, y’)) of the image point are 
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expressed linearly in terms of those of the original point. Such a trans- 
formation is therefore given by the three equations 


a = ax + by + cz-+m 
yY=dz+eytfetn 
z= ge-+ hy + ke+p, 


or in the plane by the two equations 


af = an + by +m 
y = cx + dy+n, 


with constant coefficients a, b,... These assign an image point to every 
point of space (or of the plane). The question at once arises whether we 
can interchange the relation of image point and original point, that is, 
whether every point of space (or of the plane) has an original point corre- 
sponding to it. The necessary and sufficient condition for this is that the 
equations 


axz+ by+cze=2’—m 
de+ey+fe=y'—n or 
gt + hy t+ ke=2 —p 


shall be capable of being solved for the unknowns 2, y, 2 (or 2, y), no 
matter what the values of 2’, y’, 2 are. By section 3 (p. 24) an affine 
transformation has an inverse, and in fact a unique inverse,* provided 
that its determinant 


ax-+ by = a’ —m 
ca+dy=y—n 


abe a 
A=l/de f!, or A= : ib 
ghk 


is different from zero. We shall confine our attention to affine trans- 
formations of this type, and shall not discuss what happens when 
A= 0. 

By introducing an intermediate point (x”, ’, 2’) we can resolve the 
general affine transformation into the transformations 


w’=ar+tby+ea x’ = ax + by 


y’=deteyt+fe or oy d 
2” = gu + hy + ke See 


and 
ava’ tm — alt 
eee are 
f=2’ tp ba 


Here (x, y, 2) is mapped first on (x, y”, 2”) and then (x”, y’”’, 2’) is mapped 
on (2’, y’, 2’). Since the second transformation is merely a parallel translation 
of the space (or of the plane) as a whole and is therefore quite easily under- 


* That is, every image point has one and only one original point. 
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stood, we may restrict ourselves to the study of the first. We shall there- 
fore only consider affine transformations of the form 


x = ax + by + cz 
y = dx + ey+ fe or 
Z=gethyt+k 


with non-vanishing determinants. 
The results of section 3 (p. 25) for linear equations enable us to express 
the inverse transformation by the formule 
= a'e + by + cz z= a's! + by’ 
y= ae +ey+fz or ym es! dy, 
emg’ + hi +k? 4 


a! = aa + by 
y’ = ce + dy 


in which a’, b’, . . . are certain expressions formed from the coefficients 
a,b, . . . Because of the uniqueness of the solution, the original equations 


also follow from these latter. In particular, from x = y = z = 0 it follows 
that 2’ = y’ = 2’ = 0, and conversely. 

The characteristic geometrical properties of affine transformations are 
stated in the following theorems. 

(1) In space the image of a plane is a plane; and in the plane the image 
of a straight line is a straight line. 

For by section 1 (p. 9) we can write the equation of the plane (or the 
line) in the form 

Az+ By+Cz+D=0 


(or Az + By + D= 0). 


The numbers A, B, C (or A, B) are not all zero. The co-ordinates of the 
image points of the plane (or of the line) satisfy the equation 
A(a’a! + b’y’ + c'2') + Bd’a’ + ey’ + f’2’) 
+ Ope’ + h’y + Wz’) + D=90 


(or = A(a’a’ + Oy’) + Ble’x’ + dy’) + D= 0). 


Hence the image points themselves lie on a plane (or a line), for the co- 
efficients 
A’=adA+dB4+ 90 F 
A’=aA 
O=CATSf/B+kKC 
of the co-ordinates 2’, y’, 2’ (or 2’, y’) cannot all be zero; otherwise the 
equations 


WA+dB+9/C=0 igi 
, UG i — ie) 
v4+e°RB+hH0=0 (or vVA+dB=0. 
CA+f/B+kC=0 


would hold, and these we may regard as equations in the unknowns A, B, 0 
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(or A, B). But we have shown above that from these equations it follows 
that d= B=C=0 (or A= B=0). 

(2) The image of a straight line in space ts a straight line. 

This follows immediately from the fact that a straight line may be 
regarded as the intersection of two planes; by (1) its image is also the inter- 
section of two planes and is therefore a straight line. 

(3) The images of two parallel planes of space (or of two parallel lines of 
the plane) are parallel. 

For if the images had points of intersection the originals would have 
to intersect in the original points of these intersections. 

(4) The images of two parallel lines in space are two parallel lines. 

For as the two lines lie in a plane and do not intersect one another, 
the same is true for their images, by (1) and (2). The images are therefore 
parallel. 

The image of a vector v is of course a vector v’ leading from the image 
of the initial point of vw to the image of the final point of v. Since the 
components of the vector are the differences of the corresponding co- 
ordinates of the initial and final points, under the most general affine 
transformation they are transformed according to the equations 


vf = av, + bv, + cv, 
vf = dv, + ev, + furs 
v5, = gv, + hv, + kg, 


2. The Combination of Affine Transformations and the Resolution 
of the General Affine Transformation. 
If we map a point (x, y, z) on an image point (2’, y’, 2’) by means of the 
transformation 
a’ = ax + by + cz 
y = dz+ ey + fz 
2 = get hy + ke 


a 


and then map (2’, y’, z’) on a point (2, y”, 2’) by means of a second affine 
transformation 

av” = aya’ + by’ + ¢,2' 

yf = da! + ey’ + fiz 

2! == gua! + Iny! + kz’, 


then we readily see that (x, y, z) and (x, ¥”’, 2’) are also related by an affine 
transformation: and in fact 


xv’ = age + dey + Cz 
y= dex + egy + fez 
2’ = gor t+ hey + kz 


where the coefficients are given by the equations 
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a,=aa+b,d+ 9, b,= a,b + be + yh, Cy aye + bf + yk, 
d=dated+fg, g=db+eetfh, fp=adet af + fk, 
92 = Git + hyd + kyg, he=gb+ be+ bh, = ge+ hft kk. 


We say that this last transformation is the combination or resultant of the 
first two. If the determinants of the first two transformations are different 
from zero, their inverses can be formed; hence the compound transforma- 
tion also has an inverse. The coefficients of the compound transformation 
are obtained from those of the original transformation by multiplying 
corresponding elements of a column of the first transformation and of a 
row of the second, adding the three products thus obtained, and using this 
“ product” of column and row as the coefficient which stands in the 
column with the same number as the column used and in the row with 
the same number as the row used. 
In the same way, combination of the transformations 


a’ = aw + by anal xv” = an’ + dy’ 
y' = ox + dy y= qe! + dy! 


gives the new transformation 


a” = (aya -+ bye)e + (ab + Bydly 
y’” = (a + dye)e + (qb + dydyy. 


By a primitive transformation we mean one in which two (or one) 
of the three (or two) co-ordinates of the image are the same as the corre- 
sponding co-ordinates of the original points. Physically we may think of 
@ primitive transformation as one in which the space (or plane) undergoes 
a stretching in one direction only (the stretching of course varying from 
place to place) so that all the points are simply moved along a family of 
parallel lines. A primitive affine transformation in which the motion takes 
place parallel to the z-axis is analytically represented by formulz of the 


type 


pines ae a’ = ax + by 
Se ° yay 
=e 
The general affine transformation in the plane, 
x’ = ax + by 
y == 0@ dy, 


with a non-vanishing determinant, can be obtained by a combination of 
primitive transformations. 
In the proof we may assume * that a + 0. We introduce an intermediate 


*Ifa = 0, then b + 0, and we can return to the case a + 0 by interchanging 
x and y. Such an interchange, represented by the transformation X=y, 
Y =a, is itself effected by the three successive primitive transformations 


&=e-y, &=-& . X= -htmay 
m= > mat atmn2e’ Y= wy ™ &. 
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point (& ») by the primitive transformation 
—=ax+ by, 1=Y 


whose determinant a is different from zero. From &, 7 we obtain 2’, y’ 
by a second primitive transformation 


a’ = &, et ey ae be 
a a 
with the determinant 
ad—be Ala | 
a ale al 


This gives the required resolution into primitive transformations, 
In a similar way the affine transformation in space 


a = ax + by + cz 
y = da + ey + fa 
v= gx + hy + kz, 
with a non-vanishing determinant, can be resolved into primitive transforma- 
tions. 
Of the three determinants 


a b 
de 


ae be 


"Id fl le f 


at least one must be different from zero; otherwise, as the expansion in 
terms of the elements of the last row shows, we should have 


abe 

de fi=0. 

gh k 
As in the previous case, we can then assume without loss of generality 
(1) that | | + 0, and (2) that a+ 0. The first intermediate point 


e 
(E, n, ¢) is given by means of the equations 


& = ax + by + cz 
q= y 
t= Ze 


The determinant of this primitive transformation is a, which is not zero. 
For the second transformation to &’, 1’, ¢’ we wish to put & = & ¢’ = %, 
and also to have vf =y’. One primitive transformation then remains. If 
in the equation 7 = y/ = dx + ey + fz we introduce the quantities & y, ¢ 
instead of a, y, 2, we obtain the second primitive transformation in the 
form 
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=e 
,_4@ lla b lla ec 
Me a el gla: go 
v= 
1 b ; 
The determinant of this transformation is - ‘3 I+ 0. The third trans 
é ald e 
formation must then be 
a’ = & 
y= 
a b ce} 
| e | “| de | 
h h hk 
“=~ g ra " g a 
a b bas a b at ao S 
de de de 


3. The Geometrical Meaning of the Determinant of Transforma- 
tion, and the Multiplication Theorem. 


From the considerations of the previous section we can find the 
geometrical meaning of the determinant of an affine transformation and 
at the same time an algebraic theorem on the multiplication of 
determinants. 

We consider a plane triangle with vertices (0, 0), (a1 Yr), (%2» Y2)» Whose 
area is given (section 2, p. 14) by the formula 


1 


=2 


% % 
4 Ye 
We shall investigate the relation between A and the area A’ of its 
image obtained by means of a primitive affine transformation 

a’ = ax-+ by 

y= y. 
The vertices of the image triangle have the co-ordinates (0, 0), 
(ax, + by;, Yi), (aa, + bye, Yp), and therefore 


aha =n ait, + by, 
2 "1 Ye 


xy! taf 
‘ ‘, 
% Ye 


2 


This determinant, however, can be transformed by the theorems of section 3 
(p. 22) in the following way: 


, 1 | az, + by, aa, + by, l{ax, ax, ala, a, 
A == 15 se : 
2 1 Yo 2) ti Ya 21% Ys 
that is, 
A’ = aA, 


3 (8912) 
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If we had taken the primitive transformation 


v=% 
y’ = ca + dy, 


we should have found in the same way that 
A’= dA. 


We see, therefore, that a primitive affine transformation has the effect 
of multiplying the area of a triangle by a constant independent of the tri- 
angle.* Since the general affine transformation can be formed by com- 
bining primitive transformations, the statement remains true for any 
affine transformation. In the case of an affine transformation the ratio of 
the area of an image triangle to the area of the original triangle is constant 
and independent of the choice of triangle, depending only on the coefficients of 
the transformation. In order to find this constant ratio we consider in 
particular the triangle with vertices (0, 0), (1, 0) and (0, 1), whose area A 
is 4. Since the image of this triangle according to the transformation 


a = ax + by 

y = cx + dy 
has the vertices (0, 0), (a, ¢), (0, d) its area ia 
1 ao 
2 c al’ 


ab 
=A 
€ | 


and we thus see that the constant ratio of area A’/A for an affine trans- 
formation is the determinant of the transformation. 

For transformations in space we can proceed in exactly the same way. 
If we consider the tetrahedron with the vertices (0, 0, 0), (x, 41, %), 
(es Yor 22)s (%gs Ys: 23) and subject it to the primitive transformation 


a’ = ax + by + cz 

y= y 

“= 2, 
the image tetrahedron has the vertices (0, 0, 0), (aa, + by, + cz Yq 2), 
(aig + by. + C&ay Yas 22), (ag + bys + C23, Ys, 23), 80 that its volume V’ is 


ax, + by, + cz = ty + dyn + C2 = arg + by + cry 


1 
Vv= 8 " Ye Ys 
% Ze 2s 
a| % Uy Lg 
= 6| 4% Ye Ys'i- 
2 2% es] 


* If no vertex of the triangle lies at the origin, the same fact holds, in virtue 
of the general formula for the area given on p. 14. 
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Hence V’ = av, 


where V is the volume of the original tetrahedron. For the volume of the 
image given by the primitive transformation 


a = 2% 
y = dx + ey + fe 
aw 
we similarly find that 
W=eDd, 
and for the primitive transformation 
w= 2% 
y=y 


Z=gethy+k 
we find that 
W=kv 


From this it follows that an arbitrary affine transformation has the effect 
of multiplying the volume of a tetrahedron by a constant.* In order to 
find this constant for the transformation 


x’ = ax + by + cz 
y’ = dx + ey + fe 
2 = ga + hy + kz 


we consider the tetrahedron with the vertices (0, 0, 0), (1, 0, 0), 
(0, 1, 0), (0, 0, 1), whose image has the vertices (0, 0, 0), (a, d, g), 
(b, e, h), (c, f, k). For the volumes V’ and V of the image and the 
original we therefore have 


1/2 be l 
Vise de fi, V= 63 
gh kl 
abe 
hence the determinant |d e f| is the constant sought. 
g hk 


The sign of the determinant also has a geometrical meaning. For from 
what we have seen in section 2 (p. 18) on the connexion between the sense 
of rotation and the volume of the tetrahedron or area of the triangle, it 
follows at once that a transformation with a positive determinant preserves 
the sense of rotation, while a transformation with a negative determinant 
reverses tt. 


*If no vertex of the tetrahedron coincides with the origin, this theoren 
follows from the general formula for the volume of a tetrahedron (p. 18). 
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We now consider the combination of two transformations 


a = ax + by + cz at! = aya! + by’ + cz" 
yf = dx+ ey + fz y= da! + ay + fit 
f= gat hy + ke 2! = gyal + yy’ + ky’, 


of” == (aya + bd + cyg)e + (ab + Bye + Gh)y + (aye + bf + ck)z 
yo = (da + ed + fig)e + (db + ee + fibly + (dye + af + fik)e 
a” = (ga + hyd+ kyg)e + (gb + he + kyhjy + (ge + in ft keyk)2. 


As we pass from 2, y, 2 to 2’, y’, 2’ the volume of a tetrahedron is multiplied 
by 


abe 
defi, 
hk 


ft att 


as we pass from a’, y’, 2’ to 2”, y”, 2” by 


91 fy 
and by direct change from 2, y, 2 to 2”, y’, 2” it is multiplied by 
aa+bd+eag ab+bet oh ae+ f+ ok 
datedt+fg dbteaetfh de+ef+t fik 
gnethdtig gbt+thet+ hh ge+thf+ kk 


' This gives us the following relation, known as the theorem for the 
multiplication of determinants: 


a bb Gy Gd, by Cy 
dag fi d, @ fy 
ni yy hy Jo fig Ibe 


yy + Bydy+ C192 Gybg + Byeg + Cyhe “Ayla + by fy + chy 
yy + Ody + fgg dyby + eee + fie die + Cr fa + ike 
Gag + Fiydy + Fey, Grbe + hyeg + Myke Giea + fife + hyhe 


As before, we call the elements of the determinant on the right the “ pro- 


aq by Gy dy by Cy 
ducts” of the rows of |d, ¢, f,| and the columns of |d, ¢, f,|3 at 
$n yy hy 92, hy ke 


the intersection of the i-th row and the k-th column of the product of 
the determinants there stands the expression formed from the i-th row of 


a b Gy A, by Cy 
d, & f,| and the &th column of |d, e f,|. Since rows and 
1 hy hy G2 hy ke, 


columns are interchangeable, the product of the determinants can also 
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be obtained by combining columns and rows, columns and columns, or 
rows and rows. 

For two-rowed determinants the corresponding theorem of course 
holds, namely 
yA + bib, ay0q + Bide 
€0q + db, 040, + dds 


a, by 
Cp dy 


a by 
4 
(combining rows and rows, &c.). 


ExsaMrpLis 


1. Evaluate the following determinants: 


3 4 5! 111 111 1 x 2 
(a) 4 6 6), (11 2 4), (]2 3 4, Milly yx. 
5 6 7 139 3-17 lz 2 


2. Find the relation which must exist between a, b, c in order that the 
system of equations 
3a + 4y 4+ bz 4 
4x + 5y + 6 = 5b 
5a + by + 7z=c¢ 
may have a solution. 


3*. (a) Prove the inequality 


aéoe 
D=|a WU | SV/(@+e + c) (a* + 0 + c) (a + 02 + C7), 
a”’ b” c”’ 


(6) When does the equality sign hold? 
4. What conditions must be satisfied in order that the affine trans- 
formation 
a= ax-+- by, y’ = cx -+ dy 
may leave the distance between any two points unchanged? 
5. Prove that in an affine transformation the image of a quadric 
ax? + by? + cz% 4 day + exz + fyz + gut hy +t+y7=0 
is another quadric. 
6*. Prove that the affine transformation 
a’ = an-+ by + cz 
y’ = du + ey + fz 
2 = gx+ hy + kz 


leaves at least one direction unaltered. 
7. Give the formule for a rotation through the angle » about the axis 
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z:y:z2=1:0:--1 such that the rotation of the plane 2 = z is positive 
when looked at from the point (—1, 0, 1). 


8. Prove that an affine transformation transtorms the centre of mass 
of a system of particles into the centre of mass of the image particles. 


9. If a1,..+, Ys denote the quantities on p. 6, defining a rotation of 
axes, then 
a Br ¥ 
% Be Yaf= +1. 
% Bs Ys 


CHAPTER II 


Functions of Several Variables and 
their Derivatives 


We have already become acquainted with functions of several 
variables in Chapter X of Vol. I, and there learned enough to 
appreciate their importance and usefulness. We are now about 
to enter on a more thorough study of these functions, discussing 
properties which were not touched upon in the previous volume 
and proving theorems which there were merely made plausible. 
No proof in this volume will involve previous knowledge of any 
proof developed in Chapter X of Vol. I. Yet the student is 
recommended to read that chapter, as the intuitive discussion 
given there will assist him in forming mental images of matters 
which are perhaps somewhat abstract. 

As a rule a theorem which can be proved for functions of two 
variables can be extended to functions of more than two variables 
without any essential change in the argument. In what follows, 
therefore, we shal] usually confine ourselves to functions of two. 
variables, and shall only discuss functions of three or more 
variables when some special point is involved. 


1. Tae Concert or FuNcTION IN THE CASE OF 
SEVERAL VARIABLES 
1. Functions and their Ranges of Definition. 
Equations of the form 
u=a2ty, u= xy, or u= log(l — 2*— y*) 
assign a functional value u to a pair of values (z, y). In the 


first two of these examples a value of u is assigned to every pair 


of values (x, y), while in the third the correspondence has a 
39 
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meaning only for those pairs of values (x, y) for which the 
inequality 2? + y? < 1 is true. 

Tn these cases we say that u is a function of the independent 
variables x and y. This expression we use in general whenever 
some law assigns a value of u as dependent variable, corresponding to 
each pair of values (x, y) belonging to a certain specified set, Simi- 
larly, we say that w is a function of the n variables a, %, ..., Zn 
if for every set of values (a, %, ..., %) belonging to a certain 
specified set there exists a corresponding value of u. 

Thus, for example, the volume u = xyz of a rectangular parallelepipec 
is a function of the lengths of the three sides x, y, z; the magnetic de- 


clination is a function of the latitude, the longitude, and the time; the 
sum 2%, + %+... +2, is a function of the » terms 2, ty... , Ty 


In the case of functions of two variables we represent the pair 
of values (x, y) by a point in a two-dimensional rectangular co- 
ordinate system with the co-ordinates x and y, and we occasionally 
call this point the argument point of the function. In the case of 
the functions u= «+ y and u= 2?y? this argument point can 
range over the whole of the zy-plane, and we say that these 
functions are defined in the whole zy-plane. In the case of the 
function u = log(1 — 2? — y?), the point must remain within the 
circle 2? + y® <1, and the function is defined only for points 
inside this circle. 


y 
x Oo “x 
Fig. t.—A simply-connected region Fig. 2.—A triply-connected region 


As in the case of functions of a single variable, the arguments 
in the case of functions of several variables may be either “ dis- 
continuous ” or “ continuous”, Thus the average population per 
state of the United States depends on the number of states and 
on the number of inhabitants, both of which are integers. On 
the other hand, lengths, weights, &c., are examples of continuous 
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variables. In the future we shall deal almost exclusively with 
pairs of continuously variable arguments; the point (x, y) will 
be allowed to vary in a definite “region” (or “domain ”) of 
the ay-plane, corresponding to the “interval” in the case of 
functions of one variable. This region may consist of the whole 
xy-plane; or it may consist of a portion of the plane bounded by 
a single closed curve C which does not intersect itself (a ‘“ simply- 
connected region ”; cf. fig. 1); or it may be bounded by several 
closed curves. In the last case it is said to be a “ multiply- 
connected region ”’, the number of the boundary curves giving 
the so-called “connectivity”; fig. 2, for example, shows a 
triply-connected region. 


ie) 


+—- & —»} 


Fig. 3.—A rectangular region Fig. 4.—A circular region 


The boundary curves, and in fact every curve considered in 
this volume, will be assumed to be sectionally smooth.* That is, 
we assume once and for all that every such curve consists of a 
finite number of arcs, each one of which has a continuously- 
turning tangent at each of its points up to and including the end 
points. Such curves, therefore, can at most have a finite number 
of corners or cusps. 

The most important types of region, which recur over and 
over again in the study of functions of several variables, are (1) 
the rectangular region (fig. 3), defined by inequalities of the form 


as2rsb 
eSysd, 


in which each of the independent variables is restricted to a 
definite interval, and the argument point varies in a rectangle; 


* Ger. stiickweise glatt. 
3° (B912) 
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and (2) the circular region (fig. 4), defined by an inequality of the 
form 
(— a)? + y— BPS, 


in which the argument point varies in a circle with radius r 
and centre (a, 8). 

A point P which belongs to a region R is said to be an iniertor 
point of R if we can find a circle with its centre at P lying entirely 
within R. If P is an interior point of R, we also say that R is 
a neighbourhood of P. Thus any neighbourhood of P will contain 
a sufficiently small circle with P as centre. 

We may briefly remark that corresponding statements hold 
in the case of functions of more than two independent variables, 
e.g. of three variables x, y, z. In this case the argument point 
varies in a three-dimensional region instead of in a plane region. 
In particular, this region may be a rectangular region, defined 
by inequalities of the form 


asaSb, cSySd, eSzsf, 
or a spherical region, defined by an inequality of the form 
(2— a+ y— BP +e—yP Sr 


In conclusion, we shall mention a finer distinction, which, while scarcely 
essential for the purposes of this book, is nevertheless of importance in 
more advanced study. We sometimes have to consider regions which do 
not contain their boundary points, that is, the points of the curves bound- 
ing them. Such regions are called open regions (cf. the Appendix to this 
chapter, p. 98). Thus, for example, the region x* + y* < 1 is bounded by 
the circle 22 -+ y2 = 1, which does not belong to the region; the region 
is therefore open. If, on the other hand, the boundary points do belong 
to the region, as will be the case in most of the examples which we shall 
discuss, we say that the region is closed, 


When we are dealing with more than three independent 
variables, say 2, y, 2, w, our intuition fails to provide a geometrical 
interpretation of the set of independent variables. Still, we 
shall occasionally make use of geometrical terminology, speaking 
of a system of n numbers as a point in n-dimensional space. 
By rectangular regions and spherical regions in such a space we 
naturally mean systems of points whose co-ordinates satisfy 
inequalities of the form 


% StS, bSySh, 6 S2S%, d,Susd,, Sats 
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or (e—a)?+ (y— BP+e—yP+w—or+...s7 


respectively. 

We can now give precise expression to our definition of the 
concept of function in the following words. If R is a region in 
which the independent variables x, y,... may vary, and tf a definite 
value wu is assigned to each point (x, y, .. .) of this region according 
to some law, then u = £(x, y, ...) is said to be a function of the 
continuous independent variables oh 

It is to be noted that, just as in the case of functions of one 
variable, a functional correspondence associates a unique value of 
u with the system of independent variables 2, y,.... Thus if the 
functional value is assigned by an analytical expression which 


is multiple-valued, such as arc tan v, this expression does not 


determine the function completely. On the contrary, we have still 
to specify which of the several possible values of the expression 
is to be used; in the case mentioned, we have still to state that we 
are to take the value of arc tan which lies between = and 
+ 5 or the value between 0 and 7, or we must make some other 


similar specification. In such a case we say that the expression 
defines several different single-valued branches of the function 
(ef. Vol. I, p. 17). If we wish to consider all these branches at 
once, without giving any one of them preference, we may regard 
them as together forming a multiple-valued function. In this 
book, however, we shall make use of this idea in Chap. VIII only. 


2. The Simplest Types of Functions. 


Just as in the case of functions of one variable, the simplest functions 
are the rational integral functions or polynomials. The most general 
polynomial of the first degree (linear function) is of the form 


u=ax+ by+e, 
where a, 6, and ¢ are constants. The gencral polynomial of the second 
degree has the form 
u = ax? + bry + cy® + det eyt f. 


The general polynomial of any degree is a sum of terms of the forma,,,,x”y", 
where the constants a,,,, are arbitrary. 
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Rational fractional functions are quotients of polynomials; to this class 
belongs e.g. the linear fractional function 
oat ds a 
az + by +c 


By extraction of roots we pass from the rational functions to certain 


algebraic functions,* e.g. 
= 254 = 3 H(a + y)? (@tyP 
e+ ay 


In the construction of more complicated functions of several variables 
we almost always fall back on the well-known functions of one variable,t 


e.g. 


u=sin(zarccosy) or u= log,y. 


3. Geometrical Representation of Functions. 

In Chapter X of Vol. I we discussed the two principal methods for 
representing a function of two independent variables, namely (1) by 
means of the surface u = f(x, y) in zyu-space, described by the point with 
co-ordinates (z, y, u) as (z, y) ranges over the region of definition of the 
function u, and (2) by means of the curves (contour lines) in the zy-plane 
along which w has a definite fixed value k. We shall not repeat this dis- 
cussion here. If the student is not already perfectly familiar with these 
methods of geometrical representation, he would be well advised to turn 
to the previous volume and read the discussion given there (p. 460 et seq.). 


2. CoNnTINUITY 
1. Definition. 


The reader who is acquainted with the theory of functions of 
a single variable and has seen what an important part is played 
in it by the concept of continuity will naturally expect that a 
corresponding concept will figure prominently in the theory of 
functions of more than one variable. Moreover, he will know in 
advance that the statement that the function u= (f(z, y) is 
continuous at the point (x, y) will mean, roughly speaking, that 
for all points (€, n) near (x, y) the value of the function £(&, ) 
will differ but little from {(x, y). This idea we shall express more 
precisely as follows. 

The function f(x, y), defined in the region R, is continuous at 
the point (€, n) of R, provided that for every positive number « it is 
possible to find a positive number 5 = 5(«) (in general depending on 


* For an accurate definition of the term “ algebraic function ” see p. 119. 
¢ Cf. also the section on compound functions (p. 69). 
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e and tending to 0 with «) such that for all points* of the region 
whose distance from (£, 1) is less than 8 (that is, for which 


@— éP+y—7P?s%), 
| f(z, y) —f(é n) | Ss € 


Or, in other words, the relation 


is to hold for all pairs of values (h, k) such that h? + i? < 8? and 
the point (€ + h, y + &) belongs to the region R. 

If a function is continuous at every point of a region R, we 
say that it is continuous in R. 

In the definition of continuity we can replace the distance 
condition h? + k? <= 8 by the following equivalent condition: 

To every «>0 there shall correspond two positive numbers 
5, and 5, such that 


[fE+4, 0+) —f(E 0) | Se 


whenever | h| < 8, and | k| S 3,. 

The two conditions are equivalent. For if the original con- 
dition is fulfilled, so is the second 
if we take 5, = 6, = 8/+/2; and 
conversely, if the second con- 
dition is fulfilled, so is the first 
if for 5 we take the smaller of the 
two numbers 5, and 6. 

The following facts are almost 
obvious: 

The sum, difference, and pro- 
duct of continuous functions are ° x 
also continuous. The quotient of Fig. 5—Boundary point 
continuous functions is continuous 
except where the denominator vanishes. Continuous functions of 
continuous functions are themselves continuous (cf. section 5, No. 1, 
p. 70). In particular, all polynomials are continuous, and all 
rational fractional functions are also continuous except where the 
denominator vanishes.} 


* Fig. 5 illustrates the case where (£, 7) lies on the boundary of R. 
+ Another obvious fact, which, however, is worth stating, is as follows: 
if a function f(x, y) t8 continuous in @ region R and is different from zero at an 
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A function of several variables may have discontinuities of a much 
more complicated type than a function of a single variable. For example, 
discontinuities may occur along whole arcs of curves, as in the case of the 
function w= y/z, which is discontinuous along the whole line z= 0. 
Moreover, a function f(x, y) may be continuous in x for each fixed value 
of y and continuous in y for each fixed value of x and yet be a discontinuous 

ay 

Jf(0, 0) =0. If we take any fixed non-zero value of y, this function 
is obviously continuous as a function of 2, as the denominator cannot 
vanish. If y= 0, we have f(z, 0)= 0, which is a continuous func- 
tion of x. Similarly, f(z, y) is continuous in y for each fixed value of 
x. But at every point on the line y = 2, except the point x= y= 0, 
we have f(x, y) =1; and there are points of this line arbitrarily close 
to the origin. Hence the function is discontinuous at the point 
a=y=0. 

Other examples of discontinuous functions will be found in Vol. I 
(p. 464). 


function of z and y. This is exemplified by the function f(z, y) = 


2. The Concept of Limit in the Case of Several Variables. 

The concept of the limit of a function of two variables is 
closely related to the concept of continuity. Let us suppose that 
the function f(z, y) is defined in a region R, and that (&, 7) is a 
point either within # or on its boundary. Then the statement 
that the limit of f(z, y) as x tends to & and y tends to 7 is J is 
to be understood as having the following meaning: for every 
 €>0 there isa 8> 0 such that 

fl y)—l]<e 
for all points (x, y) in R for which the inequality 
0<(@— P+ y—7nP sk 
holds. It is to be noted that, just as in the case of functions of 
one variable, the point (x, y) is required to be distinct from the 
point (£, 7). 
We symbolize the existence of the limit J by writing 

len £2: y) = l, or f(a, y) > 1 as (a, y) > (€, 9). 

yn 
interior point P of the region, it is possible to mark off about P a neighbourhood, 
aay a circle, belonging entirely to R, in which {(x, y) 18 nowhere equal to zero. 
For if the value of the function at P is a, we can mark off about P a circle so 


small that the value of the function in the circle differs from a by less than 
a/2 and therefore is certainly not zero. 
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For emphasis this is sometimes read “the double limit as z 
tends to é and y tends to 7 of f(x, y) isl”. 

Using the language of limits, we can say that a function 
f(2, y) is continuous at a point (£, ) if, and only if, 


ser f(y) = f(é, 2). 


5 ial 


We can see the matter in a new light if we consider sequences 
of points. We shall say that a sequence of points (21, y1), (22, Ye), 
++) (2ns Yn), --- tends to a limit point (£,:y) if the distance 
V/{(%a— 2+ (Yn— 7)*} tends to 0 as m increases. We can then 
show at once (cf. Vol. I, p. 47) that if f(x, y) > Las (a, y) > (€, 7), 
then lim f(a, Yn) = 1 for every sequence of points (%, Y,) in R 


n—> 
which tends to (£, ). The converse is also true; if lim f(@,, y,) 
n—> 0 


exists and is equal tol for every sequence (2, yn) of points in R 
tending to (¢, 7), then the double limit of f(z, y) as « > and 
y > 7) exists and is equal tol. We omit the proof of this. 

In our definition of limit we have allowed the point (x, y) to 
vary in the region R. If we so desire, however, we can impose 
restrictions on the point (x, y). For example, we may require it 
to lie in a sub-region R’ of R, or on a curve C, or in a set of points 
Min R. In this case we say that f(z, y) tends to ! as (a, y) tends 
to (é, 7) in R’ (or on C, or in M). It is of course understood that 
R' (or C, or M) must contain points arbitrarily close to (£, 7) in 
order that the definition may be applicable. 

Our definition of continuity then implies the two following 
requirements: (1) as (x, y) tends to (€, 4) in R the function 
f(x, y) must possess a limit 1; and (2) this limit 1 must coincide 
with the value of the function at the point (€, 7). 

It is obvious that we could define continuity of a function, 
not only in a region R but also, for example, along a curve C, 
in the same way. 


3. The Order to which a Function Vanishes.* 


If the function f(z, y) is continuous at the point (£, ), the 
difference f(a, y) —f(€, 7) tends to zero as x tends to € and y 
tends to 7. By introducing the new variables h = «— ¢ and 


* This sub-section may be omitted on a first reading. 
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k = y — yn we can express this as follows: the function ¢(h, k) 
=f(é+h, 7 +h) —f(é 7) of the variables h and & tends to 0 
as h and k tend to 0. 

We shall frequently meet with functions such as ¢(h, /) 
which tend to zero * as # and & do. As in the case of one in- 
dependent variable, for many purposes it is useful to describe 
the behaviour of ¢(h, k) as h-~0O and k-0 more precisely 
by distinguishing between different “orders of vanishing” or 
“orders of magnitude” of ¢(h, k). For this purpose we base 
our comparisons on the distance 


p= Vit B= Ve f+ y— a) 


of the point with co-ordinates x= €-+ A and y=7-+ & from 
the point with co-ordinates € and 7, and make the following 
statement: 

A function ¢(h, k) vanishes as p—>0 to the same order as 
p= Vh?-+ k’, or, more precisely, to at least the same order as p, 
provided that there is a constant C, independent of h and k, 
such that the inequality 


eel so 
; 


holds for all sufficiently small values of p; or, more precisely, when 
there is a 8 > 0 such that the inequality holds for all values 


of h and & such that 0< VA? + k?< 8. Further, we say that 
4(h, k) vanishes to a higher order + than p if the quotient £ *) 
p 


tends to0 as p->0. This is sometimes expressed by the sym- 
bolical notation ¢ d(h, &) = o(p). 


*In the older literature the expressions “ ¢(h, &) becomes infinitely small 
as h and k do” or “ d(h, k) is infinitesimal ” are also found. These statements 
have a perfectly definite meaning if we regard them simply as another way of 
saying “ ¢(h, k) tends to 0 as h and k do”. We nevertheless prefer to avoid 
the misleading expression “ infinitely small ” entirely. 

+ In order to avoid confusion, we would expressly point out that a higher 
order of vanishing for p—> 0 implies smaller values in the neighbourhood of 
p=0; for example, p* vanishes to a higher order than p, and p* is smaller 
than p, when p is nearly zero. 

$ The letter o is of course chosen because it is the first letter of the word 
order. If we wish to express the statement that ¢(h, k) vanishes to at least the 
same order as p, but not necessarily to a higher order, we use the letter O instead 
of o, writing ¢(h, k) = O(p). In this book, however, we shall not use this 
symbol. 
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Let us now consider a few examples. Since 


| h| | | 
Tie S18 THF” 


the components h and & of the distance p in the directions of the z- and 
y-axes vanish to at least the same order as the distance itself. The same 
is true for a linear homogeneous function ah+ bk with constants a and 6, 


or for the function e ee For fixed values of « greater than 1 the power 
p* of the distance vanishes to a higher order than 9; symbolically, 
e*=0(p) for a > 1. Similarly, a homogeneous quadratic polynomial 
ah? + bhk -- ck? in the variables h and & vanishes to a higher order than 


pas p>0: 
ah? +- bhk + ck* = o(¢). 


More generally, the following definition is used. If the 
comparison function w(h, k) is defined for all non-zero values 
of (4, &) in a sufficiently small circle about the origin, and 
is not equal to zero, then ¢(h, k) vanishes to at least the same 
order as w(h, k) as p> 0 if for some suitably chosen constant Cc 
the relation 

$(b, ) 
w(h, k) s0 


holds; and similarly, 6(h, k) vanishes to a higher order than o(h, k), 


or $(h, k) = o(w(h, h)), if a 


+0 when p— 0. 


For example, the homogeneous polynomial ah? + bhk +- ck? is at least 
of the same order as p%, since 


| ah? + bhe + cl®| S(lal +4 O] + lel) G+. 


1 
Also p = o( —— ). since lim (p loge) = 0 (Vol. I, p. 195). 
: | log e | p—>0 


EXAaMPLEs 


1. Discuss the behaviour of the functions 


(a) a — say’, 
(3) a4 — Baty? + of 
in the neighbourhood of the origin. 


2. How many constants does the general form of a polynomial P(x, y) 
of degree n contain? 
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3. Prove that the expression 
ax? + bay? + cx%y + dys 
vanishes at z= y= 0 to at least the same order as p? = (x? -++ y*)}, 
4. Find the condition that the polynomial 
P = aa* + 2bay + cy* 
is of exactly the same order as p? in the neighbourhood of z= 0, y= 0 
(i.e. both a and e are bounded). 


5. Are the following functions continuous at # = y = 0? 


coer a + 2ay + y* wv + Sy + ¥? 
Osrye O-are Osteyty 
= |e—y| -ao, 
= ay ae (e) e° 2—Rzys¥8, P 
Erol 2 
(f) | |. (g) [ative aye fy fier VOT. 


viet a+]? 


6. Find a 8(e) (p. 44) for those functions of Ex. 5 which are continuous, 


3. THE DERIVATIVES OF A FUNCTION 


1. Definition. Geometrical Representation. 


If in a function of several variables we assign definite numeri- 
cal values to all but one of the variables and allow only that one 


Cole 
nays 
rs 


Fig. 7 

Sections of u = f(x, y) 
variable, say x, to vary, the function becomes a function of one 
variable. We consider a function u = f(x, y) of the two variables 
g and y and assign to y a definite fixed value y= y,=c. The 
function w= f(x, Yo) of the single variable x which is thus 
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formed may be represented geometrically by letting the plane 
y = Yo cut the surface u= f(x, y) (cf. figs. 6 and 7). The 
curve of intersection thus formed in the plane is represented by 
the equation u= f(x, yo). If we differentiate this function in 
the usual way at the point « = 2, (we assume that the derivative 
exists), we obtain the partial derivative of f(x, y) with respect to x 
at the point (x, yo). According to the usual definition of the 
derivative, this is the limit * ) 


lim f(t +h, yo) — fo Yo) 
h—>0 h 

Geometrically this partial derivative denotes the tangent of the angle 
between a parallel to the z-axis and the tangent line to the curve 
u = f(x, yo). It is therefore the slope of the surface u = f(x, y) tn the direc- 
tion of the x-axis. 


To represent these partial derivatives several different nota- 
tions are used, of which we mention the following: 


in f(%o + 4, Yo) — F(@o Yo) 
h 


h—>0 


= fal®o, Yo) = UalZos Yo): 


If we wish to emphasize that the partial derivative is the limit 
of a difference quotient, we denote it by 


of a 

az” oa" 
Here we use a special round letter 0, instead of the ordinary d@ 
used in the differentiation of functions of one variable, in order 
to show that we are dealing with a function of several variables 
and differentiating with respect to one of them. 

It is sometimes convenient to use Cauchy’s symbol D, men- 

tioned on p. 90 of Vol. I, and write 


of _ 


Cc as? 


but we shall seldom use this symbol. 
In exactly the same way we define the partial derivative of 


* Tf (2. Yo) is a point on the boundary of the region of definition, we make 
the restriction that in the passage to the limit the point (# + 4, y)) must always 
remain in the region. 
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S(z, y) with respect to y at the point (2%, y,) by the relation 
lim 17 Yo “+ A= I (@o; Yo) 


k->0 


= So, Yo) "= D,f (Zo, Yo)- 


This represents the slope of the curve of intersection of the surface 
u= f(x, y) with the plane 2= a, perpendicular to the z-axis. 
Let us now think of the point (a, y), hitherto considered 
fixed, as variable, and accordingly omit the suffixes 0. In other 
words, we think of the differentiation as carried out at any point 
(x, y) of the region of definition of f(z, y). Then the two deriva- 
tives are themselves functions of x and y, 
of (x, y) of(@ y) 
ox oy 
For example, the function u= 2*+ y® has the partial derivatives 
U, = 2x (in differentiation with respect to x the term y* is regarded as a 
constant and so has the derivative 0) and u, = 2y. The partial derivatives 
of u == 25y are u, = 32*y and u, = 2°. 


U,(2, y) = f,(2, y) = and Uy (2, y= = f(a, y) = 


We similarly make the following definition for any number » 
of independent variables, 


Of (a, Lo, « . + 2x) = jimf at h, Le, S28 9 Ln) — f (Ly, Ley - ve > @n) 
0a h>o h 


= fr(%, Day s+ +5 Gn) = Dz, f (@y Ty, + + + 5 Ly), 


it being assumed that the limit exists. 

Of course we can also form higher partial derivatives of f(a, y) 
by again differentiating the partial derivatives of the “ first 
order”, f,(z, y) and f,(x, y), with respect to one of the variables, 
and repeating this process. We indicate the order of differentia- 
tion by the order of the suffixes or by the order of the symbols 
dz and dy in the “ denominator ”, from right to left,* and use 
the following symbols for the second derivatives: 


a I af 
oa = dat = fro = Dan, 


= D2 
ox (2 =a = Sou = Prof, 
of 


* In Continental usage, on the other hand, — 2g z) is written = By ae" 
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0 (of = Of oe 
5 (2 = Gyan See = Much 


a fa oO 
oy a nal aw Dyf. 


We likewise denote the third partial derivatives by 


2 (GS) = parla 
2 (Bh) = gin he 


0 [ of af 
On (ee = duty = fay &c.5 
and in general the n-th derivatives by 


oD) 
dx \A 


ri) = Gan Jom 
@ fore anf 
ay (G4 = Oyu" On"—1 = fy, &e. 


In practice the performance of partial differentiations involves 
nothing that the student has not met with already. For according 
to the definition all the independent variables are to be kept 
constant except the one with respect to which we are differentiat- 
ing. We therefore have merely to regard the other variables as 
constants and carry out the differentiation according to the rules 
by which we differentiate functions of a single independent 
variable. The student may nevertheless find it helpful to study 
the examples of partial differentiation given in Chapter X of 
Vol. I (p. 469 et seq.). 

Just as in the case of one independent variable, the possession 
of derivatives is a special property of a function, not enjoyed 
even by all continuous functions.* All the same, this property 
is possessed by all functions of practical importance, except 
perhaps at isolated exceptional points. 


*For an explanation of the term “ differentiable”, which implies more 
than that the partial derivatives with respect to 7 and y exist, see p. 60 
et 8eq. 
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2. Continuity and the Existence of Partial Derivatives with 
respect to x and y. 


Tn the case of functions of a single variable, we know that 
the existence of the derivative of a function at a point implies 
the continuity of the function at that point (cf. Vol. I, p. 97). 
In contrast with this, the possession of partial derivatives does 
not imply the a of a function of two variables: e.g. the 


function u (x, y)= with u (0, 0) = 0, has partial derivatives 


ay 2° 
everywhere, and yet we have already seen (p. 46) that it is discon- 
tinuous at the origin. Geometrically speaking, the existence of 
partial derivatives restricts the behaviour of the function in the 
directions of the z- and y-axes only, and not in other directions. 
Nevertheless the possession of bounded partial derivatives does 
imply continuity, as is stated by the following theorem: 

If a function {(x, y) has partial derivatives f, and £, everywhere 
in a region R, and these derivatives everywhere satisfy the im- 
equalities 
. [fle (<M, |A@ |< 4, 
where M is independent of x and y, then f(x, y) is continuous 
everywhere in R. 

To prove this we consider two points with co-ordinates (2, y) 
and («+ h, y+ k) respectively, both lying in the region R. 
We further assume that the two line-segments joining these 
points to the point (c+ A, y) both lie entirely in R; this is 
certainly true if (7, y) is a point interior to R and the point 
(7+ h, y+ &) lies sufficiently close to (x, y). We then have 


fethy+b)—fay=fet+rhy+h)—fet+hy} 
+{f(a+ h, y) — fla, y)}. 


The two terms in the first bracket on the right differ only in y, 
those in the second bracket only in x. We can therefore trans- 
form both the brackets on the right-hand side by means of the 
ordinary mean value theorem of the differential calculus (Vol. I, 
p. 103), regarding the first bracket as a function of y alone 
and the second as a function of z alone. We thus obtain the 
relation 


fat hyt+h)—f(e, y) =H let h, y+ Ok) + hflx +h, y), 
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where 0, and 0, are two numbers between 0 and 1. In other words, 
the derivative “with respect to y is to be formed for a point of the 
vertical line joining (2+h, y) to (zx+-h, y+), and the deriva- 
tive with respect to x is to be formed for a point of the horizontal 
line joining (x, y) and (t+ h, y). Since by hypothesis both 
derivatives are less in absolute value than M, it follows that 


\fe+thy+th—f@ y)| SM el|+ |e). 


For sufficiently small values of 4 and & the right-hand side is 
itself arbitrarily small, and the continuity of f(a, y) is proved. 


3. Change of the Order of Differentiation. 


In the examples of partial differentiation given in Vol. I it 
will be found that f,, = fey; in other words, it makes no difference 
whether we differentiate first with respect to « and then with 
respect to y, or first with respect to y and then with respect 
to x. This observation depends on the following important 
theorem: 

If the “ mixed” partial derivatives fxy and f,x of a function 
f(x, y) are continuous in a region R, then the equation 

Si ve S wy 
holds throughout the interior of that region; that 1s, the order of 
differentiation with respect to x and to y is immaterial. 

The proof, like that of the previous sub-section, is based on 
the mean value theorem of the differential calculus. We consider 
the four points (x, y), (v-+ h, y), («, yt k), and (w+ h, y+ &), 
where h=+0 and k+0. If (x, y) is an interior point of the 
region R, and h and & are small enough, all four of these points 
belong to R. We now form the expression 


A=f@thy+h—fethy—fayt+)+f(z, y). 
By introducing the function 
pla) = f(x, y + k) - fe, y) 


of the variable x and regarding the variable y merely as a “ para- 
meter’, we can write this expression in the form 


A = $(@ + h)— (a). 


Transforming the right-hand side by means of the ordinary 


56 FUNCTIONS OF SEVERAL VARIABLES  [Cuap. 


mean value theorem of the differential calculus, we obtain 
A= hd'(x + Oh), 


where 6 lies between 0 and 1. From the definition of d(x), how- 
ever, we have 


$ (x) = fila, y + k) — fla, y); 


and since we have assumed that the “mixed” second partial 
derivative f,, does exist, we can again apply the mean value 
theorem and find that 


A = hf (a + Oh, y + O°), 


where @ and 6’ denote two numbers between 0 and 1. 
Tn exactly the same way we can start with the function 


Wy) = f(a + h, y) ~ f(a, y) 
and represent 4 by means of the equation 
A= dy + & — py). 
We thus arrive at the equation 
A= hkfny(x-+ Oh, y+ 0,/k), where 0<6,<1and0<6,’<1, 


and if we equate the two expressions for A we obtain the equation 
Fuele + Oh, ¥ + Wh) = fot + Oh, y + 6,’k). 


If here we let h and & tend simultaneously to 0 and recall that 
the derivatives f,,,(z, y) and f,,(z, y) are continuous at the point 
(x, y), we immediately obtain 


Sul®, y) = Sey(a, Y)s 


which was to be proved.* 


* For more refined investigations it is often useful to know that the theorem 
on the reversibility of the order of differentiation can be proved with weaker 
hypotheses. It is, in fact, sufficient to assume that, in addition to the first 
partial derivatives f, and fy, only one mixed partial derivative, say fay, exists, 
and that this derivative is continuous at the poind in question, To prove this, we 
return to the above equation 


A =fathy + k) — f(a + h, y) — fly t+ k) + f(x, y)s 


divide by hk, and then let & alone tend to 0. Then the right-hand side hag a 
limit, and therefore the left-hand side also has a limit, and 


in 2 aw Sot thy) — Solr 9) 
hove Hh Kon 
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The theorem on the reversibility of the order of differentiation — 
has far-reaching consequences. In particular, we see that the 
number of distinct derivatives of the second order and of higher 
orders of functions of several variables is decidedly smaller than 
we might at first have expected. If we assume that all the 
derivatives which we are about to form are continuous functions 
of the independent variables in the region under consideration, and 
if we apply our theorem to the functions f,,(z, y), fy(2, Y)s fav(® Y)s 
&c., instead of to the function f(x, y), we arrive at the equations 


Sewy = Seve = Sewn 


Sew = Suey = Suves 
Seevy = Seyey = Sevve = Sueev = Syeve = f vuses 
and in general we have the following result: 
In the repeated differentiation of a function of two independent 
variables the order of the differentiations may be changed at will, pro- 
vided only that the deriwatives in question are continuous functions.* 


Further, it was proved above with the sole assumption that f,, exists that 
A 
ik Fyq(t + Oh, y + 8k). 


In virtue of the assumed continuity of f,,, we find that for arbitrary « > 0 
and for all sufficiently small values of h and k 


Sul% y) — € <fy A(t + Oh, y + Hk) <fylt y) + 6 
whence it follows that 
Sugl*, y)—e sats Ses) Shiyl% y)t+e 


on lim fz +h, v) — S(t, y) = foals ») 
h->0 
that is, f ‘cyl y) = f, ‘veal %s y)- 


*It is of fundamental interest to show by means of an example 
that in the absence of the assumption of the continuity of the second 
derivative f,., oF fy, the theorem need not be true, and that on the contrary 

2 42 
Say can differ from f,,. This is exemplified by the function f(z, y) = xy aro 
f(0, 0) = 0, for which all the partial derivatives of second order exist, but are 
not continuous. We find that 


a f(z, y) — £0, y) _ . ata yt 
fo(0, y) = tim OSE EE = lim yaa 
= tm £9) — fle, 9) _ ,, at — yt 
fas) — tn SE SLED tm, SOs 
and consequently 
fyx(0, 0) = —1 and f,,(0,0) = +1, 


These two expressions are different, which by the above theorem can only be 
due to the discontinuity of f,,, at the origin. 
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With our assumptions about continuity a function of two 
variables has three partial derivatives of the second order, 


Soo few Sun 
four partial derivatives of the third order, 
foe fav form Suv 
and in general (n + 1) partial derivatives of the n-th order, 


Son; Sumy, Saray, seve » Saw, Sm. 

It is obvious that similar statements also hold for functions 
of more than two independent variables. For we can apply our 
proof equally well to the interchange of differentiations with 
respect to x and z or with respect to y and z, &c., for each in- 
terchange of two successive differentiations involves only two 
independent variables at a time. 


EXaMyLes 


1. How many n-th derivatives has a function of three variables? 


2. Prove that the function 
L 


S (p> Hor vay Xn) => @@+et...- ers eee _2\"— 
satisfies the equation 


Foo a + forge, +e. + fary ty = 0. 


3. Calculate 
ge |@ +a b c 
= d e+a f 
Ox" 
9 h k+e2 


4. Prove that 
fy (%) fe(x) fax) 
9'(y) ge'(y) 9s'(y) 


Siz) fol) f3(2) 
my) gly) gly) 


Ox dy 2. 
moyen A(z) Inez) hs(e)| | Fy/(2) Ia/(2)y’(2) 
5. Considering 
abe 
D=|d e f 
g hk 
as a function of the nine variables a, b,. . . , k, prove that 


(a) aD, +6D,+cD,= D, 
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D, D, D, 
Dg D, Dy 
D, Dy De 


(b)* 
= D 


4. Taz Tora, DirreERENTIAL oF A FUNCTION AND ITS 
GeometricaL MEANniIne 


1. The Concept of Differentiability. 


In the case of functions of one variable the existence of a 
derivative is intimately connected with the possibility of approxi- 
mating to the function »=/f(é) in the neighbourhood of the 
point « by means of a linear function » = ¢(€). This linear func- 
tion is defined by the equation 


P(E) = f(z) + (E — 2) f"(@). 


Geometrically (¢ and 7 being current co-ordinates), this represents 
the tangent to the curve » = f(£) at the point P with the co- 
ordinates =a and 7» =/(x); analytically, its characteristic 
feature is that it differs from the function /(€) in the neighbour- 
hood of P by a quantity o(h) of higher order than the abscissa 
h= &— 2 (cf. p. 48). Hence 


f() — H€) =f(8) —f@) — (€ — a) f'(e) = off) 
or, otherwise, 
f(u+ h) — f(x) — hf'(#) = o(h) = eh, 


where ¢« denotes a quantity which tends to zero as h does. The 
term hf’(x), the “linear part” of the increment of f(x) corre- 
sponding to an increment of h in the independent variable, we 
have already (Vol. I, p. 107) called the differential of the function 
f(z) and have denoted it by 


dy = df (x) = hy! = hf'(a) 


(or also by dy = y’dx, since for the function y = « the differential 
has the value dy=dx= 1X h). We can now say that this 
differential is a function of the two independent variables 
g and hk, and we need not restrict the variable h in any way. 
Of course this concept of differential is as a rule only used when 
h is small, so that the differential Af’(x) forms an approximation 
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to the difference f(a + h) — f(z) which is accurate enough for 
the particular purpose. 

Conversely, instead of beginning with the notion of the deri- 
vative, we could have laid the emphasis on the requirement that 
it should be possible to approximate to the function n=f(é) in 
the neighbourhood of the point P by a linear function such that 
the difference between the function and the linear approximation 
function vanishes to a higher order than the increment h of the 
independent variable. In other words, we should require that 
for the function f(¢) at the point = there should exist a 
quantity A, depending on # but not on h, such that 


f(a + h) — f(a) = Ah-+ ot) = Ah+ eh, 


where ¢ tends to 0 with h. This condition is equivalent to 
the requirement that f(x) shall be differentiable at the point 2; 
the quantity A must then be taken as the derivative f’(x) at the 
point x. We see this immediately if we rewrite our condition in 


the form 
fe+M~fe)_ ai, 
h 


and then let h tend to 0, Differentiability of a function with 
respect to a variable and the possibility of approximating to a 
function by means of a linear function in this way are therefore 
equivalent properties. 

If we notice that 4 -+ «= a(x, h) is a function of h which 
tends to A(x) as h->0, we arrive at the equivalent definition: 
J (#) is said to be differentiable at the point x if S(@+h)—f(e) 
= ha(a, h), where the quantity a(z, h) is continuous, as a function 
of h, at h = 0. 

These ideas can be extended in a perfectly natural way to 
functions of two and more variables. 

We say that the function u= f(x, y) is differentiable at the 
point (x, y) if it can be approximated to in the neighbourhood of 
this point by a linear function, that is, if it can be represented 
in the form 


S(a+ hy + kh) =f(@,y)+ Ah+ Bk+ eh+ egh, 


where A and B are independent of the variables h and k and 
where «, and «, tend to 0 as A and k do. In other words, the 
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difference between the function f(a-+h, y+ %) at the point 
(e+ h, y+) and the function f(z, y)+ 4h-+ Bk which is 
linear in A and k must be of the order of magnitude* o(p), that is, 
must vanish as p->0 to a higher order than the distance 
p= /(l? + I) of the point (x + h, y + &) from the point (z, y). 

If such an approximate representation is possible, it follows 
at once that the function f(#, y) can be partially differentiated 
with respect to # and to y at the point (x, y) and that 


So = A and f, = B. 
For if we put & = 0 and divide by A we obtain the relation 
f(e+h, a) — Se Ay a: 


Since ¢, tends to zero with h, as we pass to the limit 2 > 0 the 
left-hand side has a limit, and that limit is A. Similarly, we 
obtain the equation f,(x, y) = B. 

Conversely, we shall prove that a function u=— f(z, y) is 
differentiable in the sense just defined, that is, it can be approxi- 
mated to by a linear function, if it possesses continuous deriva- 
tives of the first order at the point in question. In fact, we can 
write the increment 


Au=f@+thy+kh)—f@y) 
of the function in the form 
Au={f@+thyt+h—feyt D}+{fe yt) —f@ y}. 
As before (p. 54), the two brackets can be expressed in the form 
Au = hfale + Oh, y + h) + lfyla, y + Oh), 


using the ordinary mean value theorem of the differential 
calculus. Since by hypothesis the partial derivatives f, and f, 
are continuous at the point (x, y), we can write 


fale + Oh, y + k) = file, y) + 4 


* The equivalence of these two definitions follows from the following remark: 
the inequality |e + ek| S|e«|v(h? + k*) always holds, where « = | «,| 
+ { «,{ and tends to 0 as e, and ¢, do. Hence the second definition of differenti- 
ability follows from the first. Again, since |«eV(h? + k*)| S| | (| &| + k), 
if the second condition is fulfilled the difference between the function and the 
linear approximation is of the form 6e(|%| + ||), where -1< 0 +1, 
whence it follows that the requirements of the first definition are also fulfilled. 
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and S, WX, y+ 6k) =f, y(Z, Y) + € 


where the numbers ¢, and ¢«, tend to zero as h and kdo. We thus 
obtain 
Au = hf,{x, y) +- Kf (x, y) + gh+ €ok 
= hf,(x, y) + hf(z, y) + o(4/h? + #2), 


and this equation is the expression of the above statement.* 
We shall occasionally refer to a function with continuous first 
partial derivatives as a continuously differentiable function. If 
in addition all the second-order partial derivatives are continuous, 
we say that the function is twice continuously differentiable, 
and so on. 

As in the case of functions of one variable, the definition of 
differentiability can be replaced by the following equivalent 
definition: the function f(x, y) is said to be differentiable at the 
point (a, y) if 


Seth, y+ hk) —f(e, Y) = ah-+ Bk, 


where a and § depend on & and & as well as on x and y, and are 
continuous as functions of h and k for h = 0, k= 0. 

No further discussion is required to show how these considera- 
tions can be extended to functions of three and more variables. 


2. Differentiation in a Given Direction. 


An important property of differentiable functions is that they 
not only possess partial derivatives with respect to x and y, or, 
as we also say, in the z- and y-directions, but they also have 
partial derivatives in any other direction. By the derivative in 
the direction a. we mean the following: 

We let the point (x-+ h, y+ k) approach the point (2, y) in 
such a way that it is always on the straight line through (2, y) 
which makes the constant angle a with the positive z-axis. In 
other words, h and & do not tend to 0 independently of one 
another, but satisfy the relations 


h=pcosa and k= psina, 
where p is the distance »/(h? + k?) of the point (x-+ h, y+ k) 


* If we assume the existence only, and not the continuity, of the derivatives 
f, and f,, the function is not necessarily differentiable (of. p. 65 et seq.). 
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from the point (x, y) and tends to 0 as h and k do. If as usual 
we then form the difference f(x + h, y+ k) — f(z, y) and divide 
by p, we call the limit of the fraction 


Dey f(t, y) = lim fat p cosa, y + psina) ~ f(z ¥) 
p—>o0 p 


the derivative of the function f(x, y) at the point (a, y) in the 
direction a, provided that the limit exists. In particular, when 
= 0 we have /=0 and h= pp, and we obtain the partial 
derivative with respect to 2; when a= 7/2 we have h= 0 and 
k= p, and we obtain the partial derivative with respect to y. 
If the function f(a, y) is differentiable, we have 


flethyth)—fl, y) = Met iy + ep 
= p(f, cosa-+ f, sina + e). 


As p tends to 0, so does ¢, and for the derivative in the direction 
a we obtain the expression 


Deaf (2, y) = f, cosa + f, sina; 


it is therefore a linear function of the derivatives f, and f, in the 
x- and y-directions, with the coefficients cosa and sina. This result 
always holds good, provided that the derivatives f, and f, exist 
and are continuous at the point in question. 


Thus for the radius vector r= V(x? + y*) from the origin to the point 
(x, y) we have the partial derivatives 


t, = 7 = -=cosd and * me eo Yo int 
e Veit yor vost por” 
where @ denotes the angle which the radius vector makes with the x-axis, 

Consequently, in the direction « the function r has the derivative 

Det = Tz 008% + 7, sina = cos8 cosa + sin6 sine = cos(® — a); 
in particular, in the direction of the radius vector itself this derivative has 
the value 1, while in the direction perpendicular to the radius vector it 
has the value 0. ; 

In the direction of the radius vector the function z has the derivative 
D(x) = cos and the function y has the derivative Di»)(y) = sin9; in the 
direction perpendicular to the radius vector they have the derivatives 
Do+xjnt = —s8in® and Dio+5)2)¥ = cos@ respectively. 


The derivative of a function f(z, y) in the direction of the 
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radius vector is in general denoted by f(a, y) Thus we have 
the convenient relation or 

7) C) :) 

a cos 8 — + wane 

where any differentiable function can be written after the symbols 
00a 

Or’ Oa Oy’ 

It is also worth noting that we obtain the derivative of the 
function f(z, y) in the direction a if, instead of allowing the 
point Q with co-ordinates (x + h, y+ k) to approach the point 
P with co-ordinates (x, y) along a straight line with the direction 
a, we let Q approach P along an arbitrary curve whose tangent 
at P has the direction a. For then if the line PQ has the direction 
B, we can write h= pcos f, k= p sin, and in the formule used 
in the above proof we have to replace a by B. But since by 
hypothesis 8 tends to a as p -> 0, we obtain the same expression 
for Diy f(x, y). 

In the same way, a differentiable function f(z, y, z) of three 
independent variables can be differentiated in a given direction. 
We suppose that the direction is specified by the cosines of the 
three angles which it forms with the co-ordinate axes. If we 
call these three angles a, B, y, and if we consider two points 
(x, y, 2) and (w+ h, y+k, z+), where 


h= pcosa, 
k= pcos, 
l= p cosy, 


then just as above we obtain the expression 
f, cosa + f, cos B + f, cosy 
for the derivative in the direction given by the angles (a, f, y). 


3. Geometrical Interpretation. The Tangent Plane. 


For a function u= (f(z, y) all these matters can easily be 
illustrated geometrically. We recall that the partial derivative 
with respect to x is the slope of the tangent to the curve in which 
the surface is intersected by a plane perpendicular to the zy-plane 
and parallel to the xu-plane. In the same way, the derivative in 
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the direction a gives the slope of the tangent to the curve in which 
the surface is intersected by a plane perpendicular to the zy-plane 
and making the angle a with the z-axis. The formula D,,, f(a, y) 
= f, cosa + f, sina now enables us to calculate the slopes of 
the tangents to all such curves, that is, of all tangents to 
the surface at a given point, from the slopes of two such 
tangents. 

We approximated to the differentiable function £ = f(é, 7) 
in the neighbourhood of the point (a, y) by the linear function 


f(E ny =f @, y) + (E—*Dfe+ — Wf» © 


where € and 7 are the current co-ordinates. Geometrically this 
linear function represents a plane, which by analogy with the 
tangent line to a curve we shall call the tangent plane to the sur- 
face. The difference between this linear function and the function 
F(E, 4) tends to zero as €—x2=h and n»—y=k do, and in 
fact bua ire to a higher order than »/(h? + k?). By the defi- 
nition of the tangent to a plane curve, however, this states 
that the intersection of the tangent plane with any plane per- 
pendicular to the zy-plane is the tangent to the corresponding 
curve of intersection. We thus see that all these tangent lines to 
the surface at the point (x, y, u) lie in one plane, the tangent 
plane. 

This property is the geometrical expression of the differen- 
tiability of the function at the point (2, y, u=/f(x, y)). If 
(, 4, ¢) are current co-ordinates, the equation of the tangent 
plane at the point (x, y, u= f(z, y)) is 


f-u= (€—2)fe+ (n— yf 


As has already been shown on p. 61, the function is differen- 
tiable at a given point provided that the partial derivatives are 
continuous there. In contrast with the case where there is only 
one independent variable, the mere evistence of the partial de- 
rivatives f, and f, is not sufficient to ensure the differentiability 
of the function. If the derivatives are not continuous at the 
point in question, the tangent plane to the surface at this point 
may fail to exist, or, analytically speaking, the difference between 
S(o+ h, y+ k) and the function f(x, y) + Af.(x, y) + Ifa, y) 
which is linear in & and & may fail to vanish to a higher order 
than 4/(h? + k). 

4 


(B912) 
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This is clearly shown by a simple example. We write 


f(z y)=0 if c=0 or y=9, 
f(y y=|e] if w—y=0 or e+y=0. 


Between these lines we define the function in such a way that it is repre- 
sented geometrically by planes. The surface u = f(x, y) therefore consists 
of eight triangular pieces of planes, meeting in roof-like edges above the 
lines 2 = 0, y= 0, y= 2% and y= —z2. This surface obviously has no 
tangent plane at the origin, although the derivatives f,(0, 0) and f,(0, 0) 
both exist and have the value 0. The derivatives are not continuous at 
the origin, however; in fact, as we readily see, they do not even exist on 
the edges.* 


4. The Total Differential of a Function. 


As in the case ot functions of one variable, it is often con- 
venient to have a special name and symbol for the linear part 
of the increment of a differentiable function u= f(x, y). We 
call this linear part the differential of the function, and write 


= Og Oh of 
du = df (a, age cea eo yt 


The differential, sometimes called the total differential, is a 
function of four independent variables, namely the co-ordinates 
z and y of the point under consideration and the increments h 


* Another example of a similar type is givea by the function 


e=flen- yas 
u=0 if c=0,y=0. 


if 2? + y? + 0, 


If we introduce polar co-ordinates this becomes 
u = 5sin20. 


The first derivatives with respect to x and to y exist everywhere in the neighbour- 
hood of the origin and have the value 0 at the origin itself. These derivatives, 
however, are not continuous at the origin, for 


(wes ee Nit 
“1 (laa Vem) "eae 


If we approach the origin along the z-axis, wu, tends to 0, while if we approach 
along the y-axis, u, tends to 1. This function is not differentiable at the origin; 
at that point no tangent plane to the surface u = f(z, y) exists. For the 
equations f,(0, 0) = f,(0, 0) = 0 show that the tangent plane would have to 
coincide with the plane u = 0. But at the points of the line @ = 1/4 we have 
sin2@ = 1 and uw = r/2; thus the distance u of the point of the surface from 
the point of the plane does not, as must be the case with a tangent plane, 
vanish to a higher order than r. 
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and k, which are the differentials of the independent variables 
or independent differentials. We need scarcely emphasize once 
more that this has nothing to do with the vague concept of 
“infinitely small quantities”. It simply means that du approxi- 
mates to Au = f(z + h, y+ k) — f(a, y), the increment of the 
function, with an error which is an arbitrarily small frac- 
tion of +/(h? -+ 2?) (itself arbitrarily small), provided that h 
and k are sufficiently small quantities. Incidentally, we thus 
collect the expressions for the different partial derivatives in one 
formula. For example, from the total differential we obtain the 


partial derivative Z 7 DY putting dy = 0 and dx= 1. 

We again fee that to speak of the total differential of 
a function f(x, y) has no meaning unless the function is differen- 
tiable in the sense defined above (for which the continuity, but 
not the mere existence, of the two partial derivatives suffices). 

If the function f(x, y) also possesses continuous partial de- 
tivatives of higher order, we can form the differential of the 
differential df(x, y), that is, we can multiply its partial deriva- 
tives with respect to x and y by h= dx and k= dy respectively 
and then add these products. In this differentiation we must 
regard h and & as constants, corresponding to the fact that the 
differential df = hf, + kf, is a function of the four independent 
variables x, y, h, and k. We thus obtain the second differential * 
of the function, 


@f = d(df) = a (Za+Z bh 5 (eat Ei k 


Similarly, we can form the pele differentials 


a a ef 4 
, da dy? 
+ aya dy, 
* We shall later see (p. 80 ef seg.) that the differentials of higher order intro- 


duced formally here correspond exactly to the terms of the corresponding order 
in the increment of the function. 


Bf = def) = a Lae +39 
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ap PF db +4 OF antdy +6 °F datdy? 


da? Oy Ox? Oy? 

otf of 
4_~. dad = dy', 
+ 457, dandy’ + x=, dy" 


and, as we can easily show by induction, in general 


arp = FE ar + (" Vata a, dat-ldy +... 


onf 
dady"— + -“ dy". 
+(." 1) aga y + 5a 
The last expression can be expressed symbolically by the equation 
(n) 
apa (Law + Fay)” = (fade + fay) 
Cy oy 


where the expression on the right is first to be expanded formally 
by the binomial theorem, and then the expressions 


OF ae OF ga 5 ay 

Ban > Fae tdy OT «<9 Bn 
are to be substituted for the products and powers of the quan- 
tities f,,dx and f, dy. 


For calculations with differentials the rule 
a( fg) = fdg + gdf 


holds good; this follows immediately from the rule for the 
differentiation of a product. 

In conclusion, we remark that the discussion in this sub- 
section can immediately be extended to functions of more than 
two independent variables. 


5. Application to the Calculus of Errors. 


The practical advantage of having the differential df = hf, + kf, as a 
convenient approximation to the increment of the function f(x, y), Au = 
f(a+h, y+ k)—f(x, y), a8 we pass from (2, y) to (x-+ h, y+ k), is exhibited 
particularly well in the so-called “ calculus of errors” (cf. Vol. I, p. 349). 
Suppose, for example, that we wish to find the possible error in the deter- 
mination of the density of a solid body by the method of displacement. 
If m is the weight of the body in air and m its weight in water, by Archi- 
medes’ principle the loss of weight (m— m) is the weight of the water 
displaced. If we are using the c.g.s. system of units, the weight of the 


IT] THE TOTAL DIFFERENTIAL 69 


water displaced is numerically equal to its volume, and hence to the 
volume of the solid. The density s is thus given in terms of the inde- 
pendent variables m and m by the formula s = m/(m— m). The error 
in the measurement of the density s caused by an error dm in the measure- 
ment of m and an error dm in the measurement of m is given apprommnavaly 
by the total differential 


ds = * am + 2 dm. 
am om 


By the quotient rule the partial derivatives are 


os m 08 m 


in ow — mp oom Gn — 
hence the differential is 
ds = —mdm + mdm 
(m — m)* 


Thus the error in ¢ is greatest if, say, dm is negative and dm is positive; 
that is, if instead of m we measure too small an amount m-+ dm and 
instead of m too large an amount m + dm. For example, if a piece of brass 
weighs about 100 gm. in air, with a possible error of 5 mg., and in water 
weighs about 88 gm., with a possible error of 8 mg., the density is given 
by our formula to within an error of about 


88.5.10-° + 100.8. 10-% 


ED 9. 10-8, 


or about one per cent. 


5. Functions or Functions (Compounp Functions) AND THE 
Intropuction or New INDEPENDENT VARIABLES 


1. General Remarks. The Chain Rule. 


It often happens that the function u of the independent 
variables 2, y is stated in the form of a compound function 


u=f(E 7, +--+) 


where the arguments £, 7, ... of the function f are themselves 
functions of # and y: 


= d(z, y), n= H(z, y),.-.. 
We then say that 
u=f(é, Noes )=f(d(a, y); 2(x, y); oe = F(z, y) 


is given as a compound function of « and y. 
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For example, the function 

u= ev sin(x + y) = F(a, y) 
may be written as a compound function by means of the relations 
u=efsinn=f(% 9); =z, n=aty. 
Similarly, the function 
u = log (24 + y4). arc sin\/1 — 2? — y? = F(z, y) 

can be expressed in the form 


u= naresine = f(&, n)s 


E=V1I—@—¥, y= log(at + 4). 


In order to make this concept more precise, we adopt the 
following assumption to begin with: the functions = ¢(z, y), 
n= (a, y),... are defined in a certain region R of the inde- 
pendent variables z, y. As the argument point (x, y) varies 
within this region, the point with the co-ordinates (€, 7, .. -) 
always lies in a certain region S of &y. . .-space, in which the 
function u=f(é, 7,...) is defined. The compound function 


u=f(P(z, y), Ya, y), .- .) = F(a, y) 
is then defined in the region R. 


In many cases detailed examination of the regions R and § will be 
quite unnecessary, e.g. in the first example given above, in which the 
argument point (z, y) can traverse the whole of the ay-plane and the 
function u = ef siny is defined throughout the Ey-plane. On the other 
hand, the second example shows the need for considering the regions K 
and 8 in the definition of compound functions. For the functions 


E=V1—a2—y and n= log(at+ y') 


are defined only in the region R consisting of the points 0 < 27+ y¥7 $1, 
that is, the region consisting of the circle with unit radius and centre the 
origin, the centre being removed. Within this region | § |< 1, while y 
can have all negative values and the value 0. For the region § of points 
(&, n) defined by these relations the function 7 arc sin & is defined. 


A continuous function of continuous functions is itself con- 
tinuous. More precisely: 

If the function u = £(€, 7, .. .) is continuous in the region 8, 
and the functions & = (x, y), n= ¥(x, y), ... are continuous 
in the region R, then the compound function u = F(x, y) 1s con- 
tinuous in R. 
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The proof follows immediately from the definition of con- 
tinuity. Let (xo, yo) be a point of R, and let &, no,... be the 
corresponding values of £, 7, .... Then for any positive ¢ the 


difference 
Fé, y oe :) — fo, No oe ) 
is numerically less than ¢, provided only that the inequalities 
l€— €.1<8 |2—-| <5... 


are all satisfied, where 5 is a sufficiently small positive number. 
But by the continuity of d(x, y), ¢(«, y), . . . these last inequalities 
are all satisfied if 

|e—al<y ly—yl<y 
where y is a sufficiently small positive quantity. This establishes 
the continuity of the compound function. 

Further, we shall prove that a differentiable function of 
differentiable functions is itself differentiable. This statement is 
formulated more precisely in the following theorem, which at the 
same time gives the rule for the differentiation of compound 
functions, or so-called chain rule: 

If &= $(x, y), 7 = Wx, y), ... are differentiable functions 
of x and y in the region R, and f(é, y,...) is a differentiable 
function of €, 4, . . . in the region 8, then the compound function 


u=f (d(x, y), oa, y),.-.) = Fla, y) 


is also a differentiable function of x and y, and its partial deriva- 
tives are given by the formule 


Fo=fibet fet sees 
Py = feb + fiat ooes 


Ue = Upbeat UNet.-+s 
Uy = Uy + Uny + .. ve 


Thus in order to form the partial derivative with respect to x 
we must first differentiate the compound function with respect 
to all the functions ¢, 7, ... which depend on x, multiply each 
of these derivatives by the derivative of the corresponding 
function with respect to 2, and then add all the products thus 
formed. This is the generalization of the chain rule for 


or, briefly, by 
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functions: of one variable discussed in Vol. I, Chapter III 
(p. 153). 

Our statement can be written in a particularly simple and. 
suggestive form if we use the notation of differentials, 
namely 


du = u,de + uydy = u,dé-+ u,dy+... 
= u;(€,dx + £, dy) + u,(jede + ny dy) +. : 
= (Ugke + Uyte ++ -)da+ (UgEy + Uy + - « -)dy. 


This equation means that the linear part of the increment of the 
compound function u== f(g, 7, - . -) = F(a, y) can be found by 
first writing down this linear part as if €, 7, ... were the inde- 
pendent variables and subsequently replacing dé, dy, ... by the 
linear parts of the increments of the functions ¢= ¢(2, y), 
7 = (2, y),.... This fact exhibits the convenience and flexibility 
of the differential notation. 

In order to prove our statement we have merely to make use 
of the assumption that the functions concerned are differentiable. 
From this it follows that if we denote the increments of the 
independent variables x and y by Az and Ay, the quantities a 
change by the amounts 


AE = debe + pyAy + Ax + yAy, 
An = p, Ax + by Ay + e,Az + ye Ay, 


where the numbers €,, €2)---» Yp Ye --- tend to 0 as Az and Ay 
do, or as «/(Az? + Ay?) does. Moreover, if the quantities , , . < . 
undergo changes Ag, Ay, ..., the function u= f(€, 9, ...) is 
subject to an increment of the form 


Au = f,Aé + f,An+...+ 8A€+ 8Ant+..., 


where the quantities 8,, 5,,... tend to 0 as Ag, An, ... do, or 
as «/(Ag? + Ay2-+ ...) does (and may be taken as exactly zero 
when the corresponding increments A£é, Ay vanish). 

If in the last expression we take the mcrements Aé, An, .. 
as those due to a change of Az in the value of x and a change of 
Ay in the value of y, as given above, we obtain 


Au = (fits + frie +. Av 
+ (fibu + Sib +. Jay + eAt + yAy. 
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Here the quantities « and y have the values 


e= firey + fies tenet bed, + edgy + €, 5; + 5, +... 
vy=fanthiyet . ot by83 + py Ss + 748; + ye8n+.- 


On the right we have a sum of products, each of which contains 
at least one of the quantities €,, €,. ~~ 5 Yt» Yor - ++ Oy) Oy «+s 
From this we see that « and y also tend to 0 as Az and Ay do. 
By the results of the preceding section, however, this expresses 
the statement asserted in our theorem. 

It is obvious that this result is quite independent of the 
number of independent variables x, y,..., and remains valid e.g. 
if the quantities £, 4, ... depend on only one independent 
variable x, so that the quantity u is a compound function of the 
single independent variable 2. 

If we wish to calculate the higher partial derivatives, we have 
only to differentiate the right-hand sides of our equations with 
respect to x and y, treating /,, f,, ... a8 compound functions. 
Confining ourselves for the sake of simplicity to the case of three 
functions €, 7, and ¢, we thus obtain 


Una = feeb + Sle + feba® + 2fenEaNe + 2f, Nabe + fecEale 
+ Fic&cx + fiNex + flee 


Uey=h, ee wuts, mely eh eet aby t+feJl€ Mut Ena) +f, “Nabe +ybe) 
+ fel Enby + Efe) + febey + fiNey + fe Sevs 


Uyy=feeEv? +i + fez ot Fen Evy + ee + Fecbyoy 
+ frbuy + fin + foun 


2. Examples.* 


1. Let us consider the function 


= et? inty + Sey sing siny + y* 


We put 
E=asinty, n= 2ysinzsiny, €= y* 
and obtain 
&, = 2esin?y, yn, = 2ysinxzsiny + 2xycoszsiny, ©, = 0; 
E, = 2x siny cosy, yn, = 2esinxsiny + 2zysinz cosy, C, = 2y; 
Uy =U, = U= eftats, 


* We would emphasize that the following differentiations can also be carried 


out directly, without using the chain rule. 
«ae (avl2) 
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Hence : 
Uy = Qe** sary + Bey sing siny + ¥* (y sin?y + y sina siny + zy cosz siny) 
and 
Uy = ew sinty + Bey sins ciny + ¥* (92 siny cosy ++ x sing siny 
+ xy sinz cosy + y). 
2. In the case of the function 
u = sin(z? + y*) 
we put & = 2? + y?, and obtain 
ty = 20 cos(z2 + y*), wy = 2y cos(z* + y*), 
Vag = —43 sin (2? + y*) + 2 cos (2? + y*), Uny = —Aay sin (2? + y*), 
Uyy = —4y? sin (a? + y*) + 2 cos(a* + y?). 
3. In the case of the function 
u = are tan(z? + zy + y?), 
E=2, n=ry C= ¥ 
bias ee 
P+ + ay te 
jai a 
"Lt @ + yt 


the substitution 


leads to 


u 


3. Change of the Independent Variables. 


A particularly important application of the facts developed 
on pp. 69-74 occurs in the process of changing the independent 
variables. For example, let u = /(&, 7) be a function of the two 
independent variables £, 7, which we interpret as rectangular 
co-ordinates in the &y-plane. If we introduce new rectangular 
co-ordinates x, y in that plane (cf. p. 6) by the transformation 


f=az+ py, c= a, + agy, 
n=age+ By, y= Bié+ Bon, 


the function u = f(é, 7) is transformed into a new function of z 


and y, 


and this new function is formed from f(£, 4) by a process of com- 
pounding such as was described on p. 69. We then say that 
new independent variables x and y have been introduced into the 
relation u = f(&, 1) between the independent variables € and 7 
and the dependent variable wu. 
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The rules of differentiation given on p. 71 at once yield 
Uy = Uzay + Uy, Ag, 
Uy = UgBy + U,, Bay 


where the symbols u,, w, denote the partial derivatives of the 
function F(z, y), and the symbols u,, u, denote the partial 
derivatives of the function f(&, 7). 

Thus the partial derivatives of any function are transformed 
according to the same law as the independent variables when 
the co-ordinate axes are rotated. This is true for rotation of the 
axes in space also. 

Another important type of change of the independent variables 
is the change from rectangular co-ordinates (x, y) to polar 
co-ordinates (r, 8) which are connected with the rectangular 
co-ordinates by the equations 


2=rcos), r= /(2*+ 9°), 


y=rsind, ?= arc cos arc sin 


od y 
On introducing the polar co-ordinates we have 
u= f(a, y) = f(r cos 6, rsin#) = F(r, 6), 
and the quantity w appears as a compound function of the inde- 
pendent variables r and 6. Hence by the chain rule we obtain 


sin @ 


x y 
Uy = Ul g + Ug0e = Ur Pa Uy 


cos @ 
Uy = Uy + Uy = Uy 2 Ub ty G = Uy Sin 8 ay. 


These yield the equation 


1 
Ug? Uy? = U2 + 7? Ug’, 


which is frequently of use. By the chain rule the higher 
derivatives are given by 


sin? cos @ sin 8 sin? @ 
Uge == Ur COS Os te a 2g - ; + Up — 


cos 9 sin 8 


+ 2, nae 


76 FUNCTIONS OF SEVERAL VARIABLES  [Cuap. 


jos 6sin@ cos? 6 — sin?@ 
Ungy = Uys == Upp C089 8iNO — Ugg ————— + Ug ——_———- 
r 
sin?@ — ier sin 6 cos 8 
+ Uy ——.——_ — % ——, 
7 r 
: cos? @ cos @ sin 8 cos? @ 
Uyy = Upy 8in?O + Ugg + 2tlyg ———— + Uy 
ia r r 
cos 6 sin 8 
omer Quy SS eS 
P 


This leads us to the following formula, giving the expression 
appearing in the well-known “ Laplace’s”’ or “ potential ” equa- 
tion Au = 0 in terms of ro co-ordinates: 

0f.@ Ou 
A= thee + tyy = Ure + Yon + uy = alee (S) + Sl. 
Of the formule 


x ; 
Up = Ug —+ Uy Y¥ Uz cos + u, sind, 

r r 
Ug = —Ugy + Uy x = —u,7 sind + uyr cos 8, 


which express the rules for the differentiation of a function f(x, y) 
with respect to 7 and 06, the first is the expression for the 
derivative of f(z, y) in the direction of the radius vector r which 
we previously met with on p. 64. 

In general, whenever we are given a series of relations defining 
a compound function, 


u=f(&--.), 
é = d(z, Y)>» = d(x, Y), yee 


we may regard it as an introduction of new independent variables 
x, y instead of €, 7,.... Corresponding sets of values of the 
independent variables assign the same value to u, whether it is 
regarded as a function of £,7,... or of a, y. 

In all cases involving the differentiation of compound functions 


u=f(§, Uren 


the following point must carefully be noted. We must distin- 
guish clearly between the dependent variable uw and the func- 
tion f(g, 7, ...) which connects u with the independent variables 
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£,7,... . The symbols of differentiation u,, u,,... have no mean- 
ing until the functional connexion between wu and the indepen- 
dent variables is specified. When dealing with compound functions 
u=f(é, 7, ...)= F(z, y), therefore, we really should not write 
Uz, U, OF Uz, Uy, but should instead write f,, f, or F,, F, respec- 
tively. Yet for the sake of brevity the simpler symbols u;, u,, 
U_, Uy are often used when there is no risk that confusion will arise. 


The following example will serve to show that the result of differentiating 
a quantity depends on the nature of the functional connexion between 
it and the independent variables, that is, it depends on which of the 
independent variables are kept fixed during the differentiation. With the 
“identical” transformation §= 4%, y= y the function u= 28+ 4 
becomes uw == 2x-+ y, and we have u, = 2, u,= 1. If, however, we 
introduce the new independent variables & = a (as before) and &+ y= », 
we find that u= 2+ v, so that u,= 1, u, = 1.. That is, differentiation 
with respect to the same independent variable x gives different results in 
the two different cases. 


EXAMPLES 
1. Prove that the tangent plane to the quadric 
aa + by® + c28 = 1 
at the point (a9, ¥, Zo) is 
axty + byyy + czz = 1. 
2. If u = u(x, y) is the equation of a cone, then 
Uggtyy ~ Ugy* = 
3. Prove that if a function f(z) is continuous and has a continuous 
derivative, then the derivative of the function 


f(z) «2 1 
g(x) = jflm) am 1 
f(%_) % 1 
vanishes for a certain value between z, and 2,. 
4. Let f(x, y, z) be @ function depending only on r = V(x? + y* + 2%), 
ie. let f(x, y, 2) = g(r). 
(a) Calculate fre + fuy + fee 
(6) Prove that if foe + fyy + Sez = 0, it follows that ae = ow b (where 
a and 6 are constants). 
5. If f(a, tq, ..+, ty) = g(r) = g(V (2,2 + 22+ ...+ 2,%)), calculate 


Fcgee, + Serge, tenet Ftgaty 
(of. Ex. 2, p. 58). 
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6*. Find the expression for f,. + fyy + f,, in three-dimensional polar 
co-ordinates, i.e. transform to the variables 7, 0, p defined by 
z= rsin8 cose 
y =rsin6 sing 
z= rcos0. 
Compare with example 4(a). 
7. Prove that the expression 
f, bord + f uy 
is unchanged by rotation of the co-ordinate system. 
8. Prove that with the linear transformation 


a= ab + By 
y= yb + 8x, 


Soal® )> feyl(% Y)> fyy(%, y) are respectively transformed by the same law 
as the coefficients a, b, c of the polynomial 


ax? + 2bay + cy’. 


6. Toe Mean Vatur THEOREM AND TAyLor’s THEOREM FOR 
Funcrions oF SEVERAL VARIABLES 


1. Statement of the Problem. Preliminary Remarks. 


We have already seen in Vol. I (Chapter VI, p. 320 e seq.) 
how a function of a single variable can be approximated to in the 
neighbourhood of a given point with an accuracy of order higher 
than the n-th, by means of a polynomial of degree n, the Taylor 
series, provided that the function possesses derivatives up to the 
(n + 1)-th order. The approximation by means of the linear 
part of the function, as given by the differential, is only the first 
step towards this closer approximation. In the case of functions 
of several variables, e.g. of two independent variables, we may 
also seek for an approximate representation in the neighbourhood 
of a given point by means of a polynomial of degree n. In other 
words, we wish to approximate to f(x-+ h, y+ k) by means of 
a “Taylor expansion ” in terms of the differences h and k. 

By a very simple device this problem can be reduced to what 
we already know from the theory of functions of one variable. 
Instead of considering the function f(z + h, y + k), we introduce 
yet another variable ¢ and regard the expression 


F(t) =f(e+ M, y+ M) 


I] TAYLOR’S THEOREM 79 


as a function of t, keeping x, y, h, and k fixed for the moment. 
As ¢ varies between 0 and 1, the point with co-ordinates 
(«+ ht, y+ kt) traverses the line-segment joining (x, y) and 
(z+ h, y+ k). 

We begin by calculating the derivatives of F(t). If we assume 
that all the derivatives of the function f(z, y) which we are about 
to write down are continuous in a region entirely containing 
the line-segment, the chain rule (section 5, p. 71) at once gives 

Ft) = hf, + ify, 
F(t) = Vfae + 2hkfay + Ih yys 


and, in general, we find by mathematical induction that the n-th 
derivative is given by the expression 


POE) = Wf + ©) hUe fag ay + (7) R2h2fg oat oo tke, 


which, as on p. 68, can be written symbolically in the form 
PO (t) = (hfe + kf). 


In this last formula the bracket on the right is to be expanded by 
the binomial theorem and then the powers and products of the 


quantities af and ¢ are to be replaced by the corresponding n-th 
'Y i 


On 
derivatives os iy »-+.+ In all these derivatives the argu- 


ments « + ht and y + kt are to be written in place of x and y. 


2. The Mean Value Theorem. 


In forming our polynomial of approximation we start from 
a mean value theorem analogous to that which we already know 
for functions of one variable. This theorem gives a relation 
between the difference f(z -+ h, y+ k) — f(a, y) and the partial 
derivatives f, and f,. We expressly assume that these derivatives 
are continuous. On applying the ordinary mean value theorem 
to the function F(£) we obtain 


> > 
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where @ is a number between 0 and 1, and from this it follows that 


fet ht y+ M)— FY) _ ape + ht, y + OM) 
t 
+ kf,(x + Oht, y + Ok). 


If we put ¢ = 1 in this, we obtain the required mean value theorem 

for functions of two variables in the form 

Sat h, y+k)—f (a, y)= hfaa+ Oh, y+ Ok) + kf,(a-+ Oh, y+ 8k) 
= hflé, 0) + (E, 2)- 


That is, the difference between the values of the function at the 
points (x +h, y-+k) and (x, y) ts equal to the differential at an 
intermediate point (£,) on the line-segment joining the two points. 
It is worth noting that the same value of @ occurs in both 


fc and fy. 
The following fact, the proof of which we leave to the reader, 


is a simple consequence of the mean value theorem. A function 
f(z, y) whose partial derivatives f, and f, exist and have the 
value 0 at every point of a region is a constant. 


3. Taylor’s Theorem for Severa) Independent Variables. 


If we apply Taylor’s formula with Lagrange’s form of the 
remainder (cf. Vol. I, Chapter VI, p. 324) to the function F(t) 
and finally put ¢ = 1, we obtain Taylor’s theorem for functions of 
two independent variables, 


flat h yt bh=fla, y) + {hfe 9) + Wfla, 9} 
+ 3 {hes (x, y) + 2hlfey(, y) + fy (2 $+ .-- 


+5 {fan @, y) + (7) eee y+. +k fale | 


+ R,, 
where R,, symbolizes the remainder term 


= 1 (n+1) 
Ra= wri eet Oh, y + Ok) + kfy(a + Oh, y + Ok)}"*, 
O<@<1. 


The homogeneous polynomials of degree 1, 2,..., ”, n+ 1, 
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into which the increment f(x + 4, y+ k) — f(a, y) is thus split 
up, apart from the factors 


11 1 1 
TP arn? DP 


are respectively the first, second, ..., n-th differentials 


af = hfe + If, 
af = (hf. et Kf, oo = Tf az a hk. av + Hfyy, 


df = (hfig + lof) = Bl fn +(1) WO ifety bo + hfe 


of f(z, y) at the point (2, y) and the (n+ 1)-th differential 
d*“If at an intermediate point on the line-segment joining (x, y) 
and (c-+h, y+). Hence Taylor’s theorem can be written 


more compactly as 
Seth y+ H=fG Nt Pon +5 He N+. 
1 oe 
+ Fate, y) + Rw me 


where 
1 


(n+ 1)! 


In general the remainder R,, vanishes to a higher order than the 
term df just before it; that is, as h—0O and k—->0 we have 
R, = 0{ V2 + )"}. 

In the case of Taylor’s theorem for functions of one variable 
the passage (n> ©) to infinite Taylor series played an im- 
portant part, leading us to the expansions of many functions in 
power series. With functions of several variables such a process 
is in general too complicated. Here to an even greater degree 
than in the case of functions of one variable we lay the stress 
rather on the fact that by means of Taylor’s theorem the incre- 
ment f(2-+h, y+ k)—f(e, y) of a function is split up into 


increments df, df, .. . of different orders. 


d-1f(e + 0h, y+ 6k), O<0<1. 


no 


ExaMPLas 


1, Find the polynomial of the second degree which best approximates 
to the function sinz siny in the neighbourhood of the origin. 
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2. If f(x, y) is a continuous function with continuous first and second 
derivatives, then 
~1/ahy -Uh 
Saa(0, 0) = lim f(2h, € ) a 2f(h, e—”) + F(0, 0) 
h—>+0 he 


3. Prove that the function e~”**®” can be expanded in a series of the 
form 
$ Hale) 


% 
n=o nh - 


which converges for all values of # and y and that 
(a) H,(x) is a polynomial of degree n (so-called Hermite polynomials). 
(0) H’ p(x) = 2nHy4(). 
(c) Byy, — 2H, + 2nH,_, = 0. 
(d) A”, — 2aH’,, + 2nH,, = 0. 
4, Find the Taylor series for the following functions and indicate their 
range of validity: 


ies ji (er 


l—z 


7. Taz APPLicaTION OF Vector METHODS 


Many facts and relationships in the differential and integral 
calculus of several independent variables take a decidedly 
clearer and simpler form if we apply the ideas and notation of 
vector analysis. We shall accordingly conclude this chapter with 
some discussion of the matter. 


1. Vector Fields and Families of Vectors. 


The step which connects vector analysis with the subjects 
just discussed is as follows. Instead of considering a single 
vector or a finite number of vectors, as in Chapter I (p. 3), we 
investigate a vector manifold depending on one or more con- 
tinuously varying parameters. — 

If, for example, we consider a solid body occupying a portion 
of space and in a state of motion, then at a given instant each 
point of the solid will have a definite velocity, represented by a 
vector #. We say that these vectors form a vector field in the 
region in question. The three components of the field vector 
then appear as three functions 


Uy (2, Zo, Ls), Ug, 22, Xz), Ug(,, Le, Xz) 
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of the three co-ordinates of position, which we here denote by 
(a, %e, Zy) instead of (a, y, 2). 


A case of a velocity field is represented in fig. 8, which shows 


~ 
« 


lee et 


~ = ““y 
2 4 


—r, 


— 


Fig. 8.—The velocity field in a rotation 


the velocity field of a solid body rotating about an axis with 
constant angular velocity. 

The forces acting on the points of a moving solid body likewise 
form a vector field. As an example of a force field we consider 
the attractive force per unit mass exerted by a heavy particle, 
according to Newton’s law of gravitation. By Newton’s law 
all the vectors of this field of force are directed towards the 
attracting particle, and their lengths are inversely proportional 
to the square of the distance from the particle. 

If we pass to a new rectangular co-ordinate system by rotation 
of axes, all the vectors of the field will have new components with 
respect to the new system of axes. If the two co-ordinate systems 
are connected. by equations of the form (Chapter I, section 1, p. 6) 


Ey = 04%, + BX + Wks 


4 = Oy + Bo%e + Vote 
&5 = ah + Bet, + Yate 


y= a4 &, + ay bs + ag&, 
%_ = BE, + Boge + Bsés 
tg = 1b + vege + Yeta 


or 
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respectively, then the relations between the components 1%, Ue, Us 
with respect to the z-system and the components w,(&, &, £3), 
wo, £4, &5), w9(£,, €, €3) with respect to the new §-system are 
given by the equations of transformation 


Wy = Ayt + Bytlg + YyUs 

We = Atl, + Botte + Yells 

Ws = gt, + Bytlg + Yas 
and 

Uy == A401 + Agw2-+ agtg 

Us = Byw,-+ Botdy+ Baws 

Ug = YW, -b YaWe + Yas 


respectively. (Cf. Chap. I, p. 6.) The components a, w,, ws 
in the new system thus arise from the introduction of the new 
variables and the simultaneous transformation of the functions 
representing the components in the old system. 

When in physical applications each point of a portion of space 
has assigned to it a definite value of a function u = f(a, %, Xs), 
such as the density at the point, and we wish to emphasize that 
the property is not a component of a vector, but on the contrary 
is a property which retains the same value although the co- 
ordinate system is altered, we say that the function is a scalar 
function or scalar; or, if we wish to emphasize the association 
between the values of the function and the points of the portion 
of space, we speak of a scalar field. Thus for every vector field 2 
the quantity | 2 |? = u,2 + u,?-+ up? is a scalar; for it represents 
the square of the length of the vector and therefore retains the 
same value independently of the co-ordinate system to which 
the components of the vector are referred. 

In the examples above the vector field z is given us to begin 
with, and its components with respect to any system of rect- 
angular co-ordinates are therefore determined. If, conversely, 
in a definite co-ordinate system, say an z-system, there are given 
three functions (a, 2, £3), Up(%1, Te, Ly), Uy (21, Ta, Xe), these three 
functions define a vector field with respect to that system, the 
components of the field being given by the three functions. To 
obtain the expressions for the components w,, wa, ws in any 
other system we have only to apply the equations of transfor- 
mation deduced above. 
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In addition to vector fields, we also consider manifolds of 
vectors called families of vectors, which do not correspond to each 
point of a region in space, but are functions of a parameter ¢. We 
express this by writing #— w(t). If we think of 2 as a position 
vector measured from the origin of co-ordinates in u,tgus-space, 
then as ¢ varies the final point of this vector describes a curve 
in space given by three parametric equations, 

uy = $f), We= P(t), Ug = x(t). 

Vectors which depend on a parameter ¢ in this way can be 
differentiated with respect to t. By the derivative of a vector 
u(t) we mean the vector 2’(¢) which is obtained by the passage 
to the limit 
lim u(t + » — u(t) 


h—>0 
and which accordingly has the components 


,_ dy, dy», __ Ws 
Hea ane ee 


We see at once that the fundamental rules of differentiation 

hold for vectors. Firstly, it is obvious that if 

w=u+uwv 
then 

w= ut v'. 
Further, the product rule applied to the scalar product 
of two vectors # and v, #0 = Uy, + Ug, + Ugg (cf. p. 7) 
gives 

(ev) 


o = uv'+ wv. 


In the same way we obtain the rule 


[22] ray’ , 
oa lee eel 


for the vector product. 
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2. Application to the Theory of Curves in Space. Resolution of a 
Motion into Tangential and Normal Components. 


We shall now make some simple applications of these ideas. 
If x(t) is a position vector in 2,%%5-space which depends on a 
parameter ¢, and therefore defines a curve in space, the vector 
x’‘(t) will be in the direction of the tangent to the curve at the 
point corresponding tot. For the vector x(¢-+ h)— x(t) is in the 
direction of the line-segment joining the points (¢) and (¢-+ h) 


(cf. fig. 9); therefore so is the vector Er A ee), which differs 


from it only in the factor 1/h. As h—>O the direction of 

this chord approaches the direc- 

(t+hp tion of the tangent. If instead of 

t we introduce as parameter the 

length of the arc of the curve meas- 

ured from a definite starting-point, 

and denote differentiation with 

respect to s by mieans of a dot, we 
can prove that 


dy? ++ Hq? + a2 = 1; 


this may also be written in the form 


Fig. 9.—Differentiation of the position KX= X= 1. 
vector of a curve 


The proof follows exactly the same 
lines as the corresponding proof for plane curves (cf. Vol. I, 
Chap. V, p. 280). The vector % is therefore of unit length. If 
we again differentiate both sides of the equation +2 = 1 with 
respect to s, we obtain 


LX = 0. 


This equation states that the vector # with components 4,(s), 

#,(s), £(s) is perpendicular to the tangent. This vector we call the 

curvature vector or principal normal vector, and its absolute value, 
that is, its length 

1 a is 2 

k= P = VG? + #7 + #57), 


we call the curvature of the curve at the corresponding point. 
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The reciprocal p= 1/k of the curvature we call the radius of . 
curvature, as before. The point obtained by measuring from the 
point on the curve a length p in the direction of the principal 
normal vector is called the centre of curvature. 

We shall show that this definition of the curvature agrees 
with that given in Vol. I, Chap. V (pp. 280-3). For % is a vector 
of unit length. If we think of the vectors +(s + h) and 2(s) as 
measured from a fixed origin, then the difference <(s -+ h) — %(s) 
will be represented, as in fig. 9, by the vector joining the final 
points of the vectors #(s) and 2(s + 4). If ais the angle between 
the vectors <(s) and x(s-+ h), the length of the vector joining 
their final points is 2 sin a/2, since #(s) and 2(s + h) are both of 
unit length. Hence if we divide the length of this vector by a 
and let h > 0, the quotient tends to the limit 1. Consequently 


a 


lim $= lim ; AV { (dy(s + h) — 4(s))? + (als + 2) — ta(s))? 
h—->0 h—0 
+ (a3(s + 4) — dy(s))*}. 


Here the limit on the right is exactly +/(#?+ %7+ #,°). 
But a/h is the ratio of the angle between the directions of 
the tangents at two points of the curve and the length of are 
between those points, and the limit of that ratio is what we have 
previously defined as the curvature of the curve. 

The curvature vector plays an important part in mechanics. 
We suppose that a particle of unit mass moves along a curve 
x(t), where ¢ is the time. The velocity of the motion is then 
given both in magnitude and in direction by the vector x’(f), 
where the dash denotes differentiation with respect to ¢. Similarly, 
the acceleration is given by the vector x’(é). By the chain rule 
we have 

ds 


= KE 
dt 


(where the dot denotes differentiation with respect to s), and also 
vs , . [ds\? 
x= xX ie + # (%*) : 


In view of what we already know about the lengths of # and 
#, this equation expresses the following facts: 
The “acceleration vector” of the motion is the sum of two 
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vectors. One of these is directed along the tangent to the curve, 
and its length is equal to - that is, to the acceleration of the 


point in its path (the tangential acceleration). The other is directed 
towards the centre of curvature, and its length is equal to the 
square of the velocity multiplied by the curvature (the normal 
acceleration). 


3. The Gradient of a Scalar. 


We now return to the consideration of vector fields and shall 
give a brief discussion of certain concepts which frequently arise 
in connexion with them. 

Let u= f(a, 2, %3) be any function defined in a region of 
%Xp%_-space; that is, according to the terminology previously 
adopted, « is a scalar quantity. We may now regard the three 
partial derivatives 


Wy = fis Uy = fey Us = fe, 


in the 2-system as forming the three components of a vector 2. 
If we now pass to a new system of rectangular co-ordinates, the 
é-system, by rotation of axes, the new components of the vector 
“ are given according to the formule of p. 6 by the equations 


Oy = yt + Bytlg + YyUg 

Wg = Agtt, + Pally + Yolly 

Wg = a3t + Botte + yotty. 
On the other hand, if we introduce the rectangular co-ordinates 
£1, 2, €s a8 new independent variables in the function f (24, 22, 23), 
the chain rule gives 


Se = f,.% + f,P1 + eV 
Se, = fin + fBo + f¥2 
Fe, = Fa + SesBa + Sepa 


wy =Sey W, = Sep Ws = fry 


Hence 


and we thus see that in the new co-ordinate system also the 
components of the vector z are given by the partial derivatives 
of the function f with respect to the three co-ordinates. Thus 
to every function f in three-dimensional space there corresponds 
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a definite vector, whose components in any rectangular co- 
ordinate system are given by the three partial derivatives with 
respect to the co-ordinates. We call this vector the gradient of 
the function, and denote it by 


wu = grad f. 


For a function of three variables the gradient is an analogue 
of the derivative for functions of one variable. 

In order to form a graphical idea of the meaning of the 
gradient, we shall form the derivative of the function in the 
direction (a4; @g, ag), Where a, d2, ag are the three angles which 
this direction makes with the axes, so that cos?a,-+ cosa, 
+ cos?a, = 1. For this derivative we have already obtained the 
expression 

Df = fe, 008 a, + fp, C08 Og + fin, COS Og. 


If we think of a vector e of unit length in the direction (a1, ag, a), 
this vector will have components e, = C08 a1, €; = CO8 dg, &s = COS ag. 
Thus for the derivative of the function in the direction (a, ag, a3) 
we obtain the expression . 


Df = e grad f, 


the scalar product of the gradient and the unit vector in the 
direction (a4, a, a3), ie. the projection of the gradient on that 
vector (cf. Chap. I, p. 7). 

It is this fact that accounts for the importance of the concept 
of gradient. If, for example, we wish to find the direction in 
which the value of the function increases or decreases most 
rapidly, we must choose the direction in which the above expres- 
sion has the greatest or least value. This clearly occurs when 
the direction of e is the same as that of the gradient or is exactly 
opposite to it. , 

Thus the direction of the gradient is the direction in which the 
function increases most rapidly, while the direction opposite to that 
of the gradient is that in which the function decreases most rapidly; 
the magnitude of the gradient gives the rate of increase or decrease. 

We shall return to the geometrical interpretation of the 
gradient in Chapter III (p. 124). We can, however, immedi- 
ately give an intuitive idea of the direction of the gradient. If 
in the first instance we confine ourselves to vectors in two dimen- 
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sions, we have to consider the gradient of a function f(z, y). We 
shall suppose that this function is represented by its contour 
lines (or level lines) 


(x, y)=e 


in the ry-plane. Then the derivative of the function f(z, y) in the 
direction (cf. p. 62) of these level lines is obviously zero. For if 
P and Q are two points on the same level line, the equation 
f(P) — f(Q) = 0 holds (the meaning of the symbols is obvious), 
and the equation will still hold if we divide both sides by h, the 
distance between P and Q, and then let h tend to 0. The projec- 
tion of the gradient in the direction of the tangent to the level 
line is therefore zero, and hence at every point the gradient is 
perpendicular to the level line through that point. An exactly 
analogous statement holds for the gradient in three dimensions. 
If we represent the function f(a,, 2, 23) by its level surfaces 


I (%, %, %) = ©, 


the gradient has the component zero in every direction tangent 
to a level surface, and is therefore perpendicular to the level 
surface. 

In applications we frequently meet with vector fields which 
represent the gradient of a scalar function. The gravitational 
field of force may be taken as an example. 

If we denote the co-ordinates of the attracting particle by (&,, Eos Es), 
those of the attracted particle by (2, 2, 73), and their masses by m and 
4M, the components of the force of attraction are given by the expressions 

Cc a es 
V (Er — &)® + (&, — &)® + (Es — w5)*}8 
C bs — &e 
VE — %)? + (&_ — x)? + (Es — 2x5)? 
C 3 — ts : 
V(Es — 24)? + (Be — 29)? + (Eg — %5)?}* 
Here C is a constant with the value ymM, where y is the “ gravitational 
constant”. (The factors 


f— % 


Vii — mF + (& — mF Fe ay & 


are the cosines of the angles which the line through the two points makes 
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with the axes.) By differentiation we see at once that these components 
are the derivatives of the function 


C 
VACA — m,)? + (Eg — 2)? + (& — %3)*} 


with respect to the co-ordinates x,, 2%, a respectively. The force vector 
apart from a constant factor is therefore the gradient of the function 


1 1 
7 VE — m+ (Ep — wa + (Es — )") 
If a field of force is obtained from a scalar function by forming 
the gradient, this scaler function is often called the potential 
function of the field. We shall consider this concept from a 


more general point of view in the study of work and energy 
(Chapter V, p. 350, and Chapter VI, pp. 415, 468-81). 


4. The Divergence and Curl of a Vector Field. 


By differentiation we have assigned to every function or 
scalar a vector field, the gradient. Similarly, by differentiation 
we can assign to every vector field a certain scalar, known as the 
divergence of the vector field. Given a specific co-ordinate system, 
the a-system, we define the divergence of the vector # as the 
function 


Ou, , Oy , Oty 
Sa ee ae 


div «= 

i.e. the sum of the partial derivatives of the three components 
with respect to the corresponding co-ordinates. Suppose now 
that we change the co-ordinate system to the é-system. If the 
divergence is really to be a scalar function associated with the 
vector field and independent of the particular co-ordinate system, 
we must have 
Ow, , Owe , Owe 
api tag tag? 
0g, Of, Es 
where w,, W2, ws are the components of w in the é-system. In 
fact, the truth of the equation 

Ou, , ug , Oly Aw, , Ow, | Cwsy 

on Oe 0, On Pes 
can be verified immediately by applying the chain rule and the 
transformation formule of p. 84. 


div u= 
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Here we content ourselves with the formal definition of the 
divergence; its physico-geometrical interpretation will be dis- 
cussed later (Chapter V, section 5, p. 388). 

We shall adopt the same procedure for the so-called curl * 
of a vector field.. The curl is itself a vector 


y= curl w 
whose components 1, f, 73 are defined by the equations 


1, = Ota __ Oty ry = Oth __ Og te = Olle __ Oly 
rn) nr ar Saar, rs 
In order to show that our definition actually gives a vector 
independent of the particular co-ordinate system, we could verify 
by direct differentiation that the quantities 
py tee By Dery Beng _ ny ey 
a a Sad A Me 
which define the curl in terms of the new co-ordinates, are con- 
nected with the quantities r,, 7,, rs by the equations of transfor- 
mation for vector components. Here, however, we shall omit 
these computations, since in Chapter VI, section 6 (p. 396) we 
shall give a physical interpretation of the curl which clearly 
brings out its vectorial character. 
The three concepts of gradient, divergence, and curl can all 
be related to one another if we use a symbolic vector with the 


components @ ig i 
Oar,’ Om,’ Oxy. 
nablat and is denoted by the symbol V. The gradient of a scalar 
field f(z, %, %), grad f, is the product Vf of the scalar quantity 
f and the symbolic vector V, that is, it is a vector with the com- 
ponents af af af 


The curl of a vector field u(21, X, 2), curl 2, is the vector product 
[Vze] of the vector z and the symbolic vector V; finally, the 
divergence is the scalar product 


This symbolic vector is often called 


i 0 
div sea: Vee = Or 4 On, Oly 
0%, 0%, Oty 
* Often called rotation (with the abbreviation rot). 
t After a Hebrew stringed instrument of similar shape. 
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In conclusion we mention a few relations which constantly 
recur. The curl of a gradient is zero; in symbols 


curl grad f = 0. 


As we easily see, this relation follows from the reversibility of 
the order of differentiation. 
The divergence of a curl is zero; in symbols 


div curl « = 0. 


This also follows directly from the reversibility of the order of 
differentiation. 

The divergence of a gradient is an extremely important expres- 
sion frequently occurring in analysis, notably in the well-known 
“ Laplace’s ” or “ potential equation”. It is the sum of the 
three “ principal ” second-order partial derivatives of a function; 
in symbols 

. ee OF OF OF 
div gad f= Af= 5+ 5+ gp 
where Af is written as an abbreviation for the expression on the 
right.* The symbol 
e a 
~ att ant Tangs 
is called the Laplacian operator. 

Finally, we may mention that the terminology of vector 
analysis is often used in connexion with more than three inde- 
pendent variables; thus a system of n functions of n independent 
variables is sometimes called a vector field in n-dimensional 
space. The concepts of scalar multiplication and of the gradient 
then retain their meanings, but in other respects the state of 
affairs is more complicated than in the case of three dimensions. 


ExaMpPizs 


1. Find the equation of the so-called osculating plane of a curve 
2 = f(t), y = g(t), 2 = A(t) at the point fo, i.e. the limit of the planes passing 
through three points of the curve as these points approach the point with 
parameter f). 

2. Show that the curvature vector and the tangent vector both lie in 
the osculating plane. 


* The notation V?f is also used. 
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3*. Let x = x(s) be an arbitrary curve in space, such that the vector 
x(s) is three times continuously differentiable (s is the length of arc). 
Find the centre of the sphere of closest contact with the curve at the 
point s. 

4. If C is a continuously differentiable closed curve and A a point 
not on C, there is a point B on C which has a shorter distance from A 
than any other point on GC. Prove that the line AB is normal to the 
curve. 

5. If x = x(s) is a curve on a sphere of unit radius, the equation 


holds. 


6. If x = x(t) is any parametric representation of a curve, then the 
a2. 
vector = with initial point x lies in the osculating plane at 2. 


7. The limit of the ratio of the angle between the osculating planes 
at two neighbouring points of a curve and the length of arc between 
these two points, ie. the derivative of the unit normal vector with 
respect to the arc (8), is called the torsion of the curve. Let &,(s), 6,(s) 
denote the unit vectors along the tangent and the curvature vector of the 
curve x(s); by §3(s) we mean the unit vector orthogonal to &, and &, 
(the so-called binormal vector), which is given by [6,6]. Prove Frenet’s 
formulae 


é, = E./e, 
a= —E,/e + &3/t, 
§3 a —€./=, 


where 1/p = & is the curvature and 1/z the torsion of x(s). 

8. Using the vectors §,, &., §; of Ex. 7 as co-ordinate vectors, find 
expressions for (a) the vector x, (b) the vector from the point z to the 
centre of the sphere of closest contact at 2. 

9. Show that a curve of zero torsion is a plane curve. 

10*. Prove that if z= u(x, y) represents the surface formed by the 
tangents of an arbitrary curve, then (a) every osculating plane of the curve 
is a tangent plane to the surface; (b) u(x, y) satisfies the equation 

Upggtlyy — Uny? = 0. 
11. Prove that 
curl curl # = grad div # — Az. 
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Appendix to Chapter II 


1. Tae Princrece of THE Pornt or ACCUMULATION IN SEVERAL 
DIMENSIONS AND ITS APPLICATIONS 


If we wish to refine the concepts of the theory of functions 
of several variables and to establish it on a firm basis, without 
reference to intuition, we proceed in exactly the same way as in 
the case of functions of one variable. It is sufficient to discuss 
these matters in the case of two variables only, since the methods 
are essentially the same for functions of more than two inde- 
pendent variables. 


1. The Principle of the Point of Accumulation. 


We again base our discussion on Bolzano and Weierstrass’s 
principle of the point of accumulation. A pair of numbers (2, y) 
will be called a point P in space of two dimensions, and may be 
represented in the usual way by means of a point with the rect- 
angular co-ordinates x and y in an zy-plane. We now consider 
a bounded infinite set of such points P(x, y); that is, the set 
is to contain an infinite number of points, and all the points are 
to lie in a bounded part of the plane, so that |x| <C and|y|<C, 
where C is a constant. The principle of the point of accumulation 
can then be stated as follows: every bounded infinite set of points 
has at least one point of accumulation. That is, there exists a point 
Q with co-ordinates (€, 7) such that an infinite number of points 
of the given set lie in every neighbourhood of the point Q, say 
in every region 

|o—€1 <3, |y—a] <8 


where 8 is any positive number. Or, in other words, out of the 
infinite set of points we can choose a sequence P,, Ps, Ps,. . . im such 
a way that these points approach a limit point Q. 

_ This principle of the point of accumulation is just as intuitively 
clear for several dimensions as it is for one dimension. It can be 
proved analytically by the method used in the corresponding 
proof in Vol. I (p. 58), merely by substituting rectangular regions 
for the intervals used there. An easier proof can be constructed, 
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however, by using the principle of the point of accumulation 
for one dimension. To do this we notice that by hypothesis every 
point P(x, y) of the set has an abscissa x for which the inequality 
|%|<Cholds. Either there is an x= x, which is the abscissa 
of an infinite number of points P (which therefore lie vertically 
above one another) or else each x belongs only to a finite number 
of points P. In the first case, we fix upon z, and consider the 
infinite number of values of y such that (x, y) belongs to our 
set. These values of y have a point of accumulation 7, by the 
principle of the point of accumulation for one dimension. Hence 
we can find a sequence of values of y, say ¥,, Y2, ..., such that 
Yn > No, from which it follows that the points (x9, y,) of the set 
tend to the limit point (a, 7), which is thus a point of accumu- 
lation of the set. In the second case, there must be an infinite 
number of distinct values of x which are the abscisse of points 
of the set, and we can choose a sequence 2,, 2, ... of these 
abscisse tending to a unique limit €. For each 2, let Pil@ns Yn) 
be a point of the set with abscissa x,. The numbers y, are an 
infinite bounded set of numbers; hence we can choose a sub- 
sequence Yn, Yn, - .. tending to a limit 7. The corresponding 
sub-sequence of abanisase Ln.»Tq,, -.. Still tends to the limit ¢; hence 
the points P,,, P,,, .. . tend to the limit point (£, 7). In either 
case, therefore, we Can find a sequence of points of the set tending 
to a limit point, and the theorem is proved. 

A first and important consequence of the principle of the 
point of accumulation is Cauchy’s convergence test, which can be 
expressed as follows: 

A sequence of points P,, Ps, Ps, . . . with the co-ordinates (x,; y1), 
(Xe, Ye), (Xs, Ys), - - - tends to a limit point of, and only tf, for every 
«> 0 there 1s a suffiz N= N(e) such that the distance between 
the points P, and P,,, V(X, — Xm)? + (Ya — Ym) 18 less than 


e whenever both n and m are greater than N 


2. Some Concepts of the Theory of Sets of Points. 

The general concept of a limit point is fundamental in many 
of the more refined investigations of the foundations of analysis 
based on the theory of sets of points. Although these matters 
are not, essential for most of the purposes of this book, we shall 
mention some of them ‘here for the sake of completeness. 

A bounded set of points, consisting of an infinite number of 
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points, is said to be closed if it contains all its limit points; that 
is, limit points of sequences of points of the set are again points 
of the set. For example, all the points lying on a closed curve 
or surface form a closed set. For functions defined in closed sets 
we can state the two following fundamental theorems: 

A function which is continuous in a bounded closed set of 
points assumes a greatest and a least value in that set. 

A function which is continuous in a bounded closed set is 
uniformly continuous in that set. 

The proofs of these theorems are so like the corresponding 
proofs for functions of one variable that we shall omit them. 

The least upper bound of the distance between the points P, 
and P, for all pairs of points P,, P,, where both points belong to a 
set, is called the diameter of that set. If the set is closed, this 
upper bound will actually be assumed for a pair of points of the 
set. The student will be able to prove this easily, remembering 
that the distance between two points is a continuous function 
of the co-ordinates of the points. 

By using the theorem that a continuous function on a bounded 
closed set does assume its least value, we can readily establish 
the following fact: if a point P does not belong to a closed set M, 
a positive least distance from P to M exists; that is, a point Q of 
M exists such that no point of M has a smaller distance from P 
than Q has. This enables us to show that the closed regions 
defined in section 1 (p. 41) are actually closed sets according 
to the definition here. For let C be a closed curve, and let R be 
the closed region consisting of all points interior to C or on C; 
we have to show that all the limit points of R belong to R. We 
assume the contrary, i.e. that there is a point P not belonging to 
R which is a limit point of R. Then, in particular, P does not lie 
on C; hence by the theorem above it has a positive least distance 
from C (C being a closed set). We can therefore describe a circle 
about P as centre, so small that no point of C lies in the circle; 
we have only to make the radius of the circle less than the 
least distance from P to C. The point P is outside C, since 
otherwise it would belong to R; and since every point in the 
small circle can be joined to P by a line-segment which does 
not cross the curve C, every point of the circle lies outside C, 
and so no point of the circle belongs to R. But we assumed that 
P is a limit point of R, which requires that the circle should 

6 (8912) 
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contain an infinite number of points of R. Hence the assumption 
that there is a limit point of R which does not itself belong to R 
leads to a contradiction, and our assertion is proved. The extension 
to closed regions R bounded by several closed curves is obvious. 

A useful property of closed sets is contained in the theorem 
on shrinking sequences of closed sets: 

If the sets M,, My, Ms, . . . are all closed, and each set is con- 
tained in the preceding one, then there is a point (&, n) which belongs 
to all the sets. 

In each of the sets 1M, let us choose a point P,. The sequence 
P,, must either contain an infinite number of repetitions of some 
one point, or else an infinite number of distinct points. If 
one point P is repeated an infinite number of times, then it 
belongs to all the sets; for if M,, is any one of the sets, P belongs 
to a set M,,, where m, >, and M,, is contained in M,. If 
there are an infinite number of distinct points P,, then by the 
principle of the point of accumulation they possess a point of 
accumulation (&, 7). This point belongs to each M,. For when- 
ever m > n the point P,, belongs to M,, since it is a point of 
M,, which is contained in M,. Hence (€, 7) is a limit point of 
points P,, of M,, and since M, is closed, (£, 7) is a point of M,. 
Thus in either case there exists a point common to all the sets 
M,, and the theorem is proved.* 

A set is said to be open if for every point of the set we can find 
a circle about the point as centre which belongs completely to 
the set. An open set is connected if every pair of points A and B 
of the set can be joined by a broken (polygonal) line which 
lies entirely in the set. 

The word “domain” is often used with the restricted 
meaning of a connected open set. As examples we have the 
interior of a closed curve, or the interior of a circle with the 
points of a radius removed. The points of accumulation of 
a domain which do not themselves belong to the domain are 
called the boundary points. The boundary B of a domain D is a 
closed set. Here we shall sketch the proof of this statement. 


* The assumption that the sets M,, are closed is essential, as the following 
example shows. Let M,, be the set O<2< ‘ Each set is contained in the 
preceding, but no point belongs to all the sets. For if x = 0 the point belongs 
to no set, while if x > 0 it belongs to no set M,, for which 2 <e 
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A point P which is a limit point of B does not belong to D, 
for every point of D lies in a circle composed only of points of 
D and hence devoid of points of B. It is also a limit point of D, 
for arbitrarily close to P we can find a point Q of B, and arbi- 
trarily close to Q we can find points of D. Hence P belongs 
to B. 

If to a domain D we add its boundary points B, we obtain a 
closed set. For every limit point of the combined set is either 
a limit point of B and belongs to B, or is a limit point of D 
and belongs either to D or to B. Such sets are called closed 
regions, and are particularly useful for our purposes. 

Finally, we define a neighbourhood of a point P as any open 
set containing P. If we denote the co-ordinates of P by (€, 7), 
the two simplest examples of neighbourhoods of P are the circular 
neighbourhood, consisting of all points (x, y) such that 


(a — P+ y—nP <8, 
and the square neighbourhood, consisting of all points (x, y) such 


that 
Jva—€]<8 and Jy—7| <8 


3. The Heine-Borel Covering Theorem. 


A further consequence of the principle of the point of accumu- 
lation, which is useful in many proofs and refined investigations, 
is the Heine-Borel covering theorem, which runs as follows: 

If corresponding to every point of a bounded closed set M a 
neighbourhood of the point, say a square or a circle, ts assigned, 
it is possible to choose a finite number of these neighbourhoods in 
such a way that they completely cover M. The last statement of 
course means that every point of M belongs to at least one of 
the finite number of selected neighbourhoods. 

By an indirect method the proof can be derived almost im- 
mediately from the theorem on shrinking closed sets. We suppose 
that the theorem is false. The set M, being bounded, lies in a 
square Q. This square we subdivide into four equal squares. 
For at least one of these four squares, the part of M lying in or 
on the boundary of that square cannot be covered by a finite 
number of the neighbourhoods; for if each of the four parts of 
M could be covered in this way, M itself would be covered. 
This part of M we call M,, and we see at once that M, is closed. 


100 FUNCTIONS OF SEVERAL VARIABLES [Cuap. 


We now subdivide the square containing M, into four equal 
squares. By the same argument, the part M, of M, lying in or on 
the boundary of one of these squares cannot be covered by a 
finite number of the neighbourhoods. Continuing the process, 
we obtain a sequence of closed sets M,, M,, Mg, ..., each en- 
closed in the preceding; each of these is contained in a square 
whose side tends to zero, and none of them can be covered by a 
finite number of the neighbourhoods. By the theorem on shrink- 
ing sequences of closed sets we know that there is a point (£, 7) 
which belongs to all these sets, and hence a fortiori belongs 
to M. To the point (é, 7) there accordingly corresponds one of 
the neighbourhoods, containing a small square about (£, 7). 
But since each M,, contains (£, 7) and is itself contained in a 
square whose side tends to 0 as 1/n does, each M,, after a certain 
n is completely contained in the small square about (£, 7), and 
is therefore covered by one neighbourhood of the set. The assump- 
tion that the theorem is false has therefore led to a contradiction, 
and the theorem is proved. 


ExXAMPLEs 


1. A convex region R may be defined as a bounded and closed region 
with the property that if A, B are any two points belonging to R, all 
points of the segment AB belong to R. Prove the following state- 
ments: 

(a)* If A is a point not belonging to R, there is a straight line 
passing through A which has no point in common with R. 

(b)* Through every point P on the boundary of R there is a straight 
line 7 (a so-called “line of support”) such that all points of R lie on one 
and the same side of J or on / itself. 

(c) If a point A lies on the same side of every line of support as the 
points of R, then A is also a point of R. 

(d@) The centre of mass of RF is a point of R. 

(e) A closed curve forms the boundary of a convex region, provided 
that it has not more than two points in common with any straight line. 

(f)* A closed curve forms the boundary of a convex region, provided 
that its curvature is everywhere positive. (It is assumed that if the 
whole curve is traversed the tangent makes one complete revolution.) 

2. (a) If 8 is an arbitrary closed and bounded set, there is one “ least 
convex envelope” F of S, i.e. a set which 
(1) contains all points of S, 
(2) is contained in all convex sets containing 8, 
(3) is convex. : 
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(6) # may also be described in the following way: 

A point P is in # if, and only if, for every straight line which leaves 
all points of 8 on one and the same side, P is also on this side, 

(c) The centre of mass of 9 is a point of HZ. 


2. Tame Concert or Lunt ror Functions oF 
SEVERAL VARIABLES 


We shall find it useful to refine our conceptions of the various 
limiting processes connected with several variables and to consider 
them from a single point of view. Here we again restrict our- 
selves to the typical case of two variables. 


1. Double Sequences and their Limits. 


In the case of one variable we began with the study of se- 
quences of numbers a,, where the suffix » could be any integer. 
Here double sequences have a corresponding importance. These 
are sets of numbers @,,, with two suffixes, where the suffixes m 
and run through the sequence of all the integers independently 
of one another, so that we have e.g. the numbers 


My. Ayo, Foi, 2g, Ay, Agi, Aya, Aags- ++ + 
Examples of such sequences are the sets of numbers 


1 n 


fe ee Oe m8 Om 


We now make the following statement: 

The double sequence a,,,, converges as n-> © and m-—> © to 
a linut, or more precisely a “double limit”, 1 if the absolute 
difference | a,,,—1| is less than an arbitrarily small pre-assigned 
positive number « whenever n and m are both sufficiently large, that 
ts, whenever they are both larger than a certain number N depend- 
ing only on «. We then write 


Thus, for example, 
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2 
and iy er Dt = 0. 
n—>o mn? n—> ow n? m 
m—>o m—> 0 


Following Cauchy, we can determine, without referring to the 
limit, whether the sequence converges or not, by using the 
following criterion: 

The sequence a,,, converges if, and only if, for every «> 0 
a number N = N(c) exists such that | ag — Apa | < € whenever 
the four suffixes n, m, n’, m’ are all greater than N. 

Many problems in analysis involving several variables depend 
on the resolution of these double limiting processes into two 
successive ordinary limiting processes. In other words, instead 
of allowing n and m to increase simultaneously beyond all bounds, 
we first attempt to keep one of the suffixes, say m, fixed, and let 
n alone tend to ©. The limit thus found (if it exists) will in 
general depend on m; let us say that it has the value l,,. We 
now let m tend to 0. The question now arises whether, and 
if so when, the limit of 1,, is identical with the original double 
limit, and also the question whether we obtain the same result, 
no matter which variable we first allow to increase; that is, 
whether we could have first formed the limit lim a,,, = A, and 


m—->~@o 
then the limit lim A,, and still have obtained the same result 


n—-> © 


We shall begin by gaining a general idea of the position from a 


few examples. In the case of the double sequence a,,., = » when 
n+m 
m is fixed we obviously obtain the result lim a,,,, = 1, = 0, and therefore 
r— wo 
lim J,, = 0; the same result is obtained if we perform the passages to the 


M—pn 3 
limit in the reverse order. For the sequence 


however, we obtain 
lim @ym = by = 1 
n—>o 


and consequently 
lim 1, = 1; 
m—>o 


while on performing the passages to the limit in the reverse order we first 
obtain 
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lim Gay, = Ay = 0 


n> 2 
and then 
lim A, = 0. 
n—>-o 
In this case, then, the result of the successive limiting processes is not 
independent of their order: 


lim (lim a,,,,) + lim (lim a,,,,). 
mo n> n~>@ m—>o 


In addition, if we let and m increase beyond all bounds simultaneously, 
we find that the double limit fails to exist.* 
Another example is given by the sequence 
sinn 


a. =, 
nm 
m 


Here the double limit lim a,,,,, exists and has the value 0, since the nu- 
n—> wo 


n> 
merator of the fraction can never exceed | in absolute value, while thedenomi- 
nator increases beyond all bounds. We obtain the same limit if we first let m 
tend to ©; we find that lim a,,,,—A,=0, so that lim 4,—0. If, how- 
m co) 


> n—>o 
ever, we wish to perform the passages to the limit in the reverse order, keeping 
m fixed and letting n increase beyond all bounds, we encounter the difficulty 


that lim sinn does not exist. Hence the resolution of the double limiting 
a—>-o 

process into two ordinary limiting processes cannot be carried out in both 

ways. 


The position can be summarized by means of two theorems. 
The first of these is as follows: 
If the double limit lim aj, = 1 exists, and the simple limit 


n> o 
m—> 


lim nm = lm, exists for every value of m, then the limit lim 1,, 
n—>@ 


also exists, and lim I,,— 1. Again, if the double limit Guiteend 
m—> 


has the value J, and the limit lim a,,., = i, exists for every value 


m—>o0 
of n, then lim A, also exists and has the value J. In symbols: 
n—->o 


b= lim dyn = lim (lim dym) = lim (lim a 


n—> 0 m—> © n—> © u—> 2 m—> wo 
m—>o 


am)} 


* For if such a limit existed it would necessarily have the value 0, since 
we can make @,,, arbitrarily close to 0 by choosing 7 large enough and choosing 
m = n*. On the other hand, a,,,, = } whenever » = m, no matter how large 
nis. These two facts contradict the assumption that the double limit exists. 
But even when lim (lim a,,,,) = lim (lim a,,,,) the double limit lim By, May 

m—>2 n—>w n> m—>wo n> 

1 m—>o 
i i is sho bh y = 
fail to exist, as is shown by the example a,,,, nad 
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the double limit can be resolved into simple limiting processes 
and this resolution is independent of the order of the simple 
limiting processes. 
The proof follows almost at once from the definition of the 
double limit. In virtue of the existence of lim a,,, = l, for every 
n> @ 
m—> ao 
positive ¢ there isan N = N(c) such that the relation | dam — I|<e 
holds whenever » and m are both larger than N. If we now 
keep m fixed and let n increase beyond all bounds, we find 
that | lim @n_—1t|=|[l,—t| <<. This inequality holds for 
n—>o 


any positive « provided only that m is larger than M(e); in 

other words, it is equivalent to the statement lim (lim Gym) = 1. 
m—>O n—> oO 

The other part of the theorem can be proved in a similar 

way. 

The second theorem is in some respects a converse of the 
first. It gives a sufficient condition for the equivalence of a 
repeated limiting process and a double limit. This theorem 
is based on the concept of uniform convergence, which we define 
as follows: 

The sequence a,,,, converges as n> © to the limit l,, uniformly 
in mn, provided that the limit lim a,,,, = 1,, exists for every m and in 


n-—>o 
addition for every positive « i is possible to find an N = N(e), 
depending on « but not on m, such that |l,, — a_|< € whenever 
n>N. 


Je th a n ao 1 
For example, the sequence @,,,, = —-—— = — — converges 


1 mn+m) m ntm 
uniformly to the limit J, = —, as we see immediately from the estimate 
m 


lL 1 
<-3 
n+m n 


Gam — Pie 
nm m 
we have only to put N = - On the other hand, the condition for uniform 
€ 


™_. For 
m+n 
fixed values of m the equation lim a,,, = 1,, = 0 is always true; but the 

J n— oO 

convergence is not uniform. For if any particular value, say 1/100, is 
assigned to e, then no matter how large a value of n we choose there are 
always values of m for which | dnm— Um| = nm exceeds c. We have 
only to take m= 2n to obtain a,,,, = $, which is a value differing from 
the limit 0 by more than 1/100. 


convergence does not hold in the case of the sequence ay, = 


Ih THE CONCEPT OF LIMIT 105 


We now have the following theorem: 
[f the limit lim a,,, = 1, exists uniformly with respect to m, 


and if further the limit lim I,,—= Leaists, then the double limit li tpn 


m—->o n—-—>0 
exists and has the value 1: mane 
lim (lim ay,,) = lim Gam 
m—>o n—>2o n—> D 
m—>o 


We can then reverse the order of the passages to the limit, provided 
that lim a,,, = A, exists. 


m->o 


By making use of the inequality 
| Gam — b] S| Gam — In| +] bn — #] 


the proof can be carried out just as for the previous theorem, 
and we accordingly leave it to the reader. 


2. Double Limits in the Case of Continuous Variables. 


Tn many cases limiting processes occur in which certain suffixes, 
e.g. %, are integers and increase beyond all bounds, while at the 
same time one or more continuous variables z, y, ... , tend to 
limiting values £, n, .... Other processes involve continuous 
variables only and not suffixes. Our previous discussions apply 
to such cases without essential modification. We point out in the 
first instance that the concept of the limit of a sequence of func- 
tions f,(x) or f,(z, y) a8 m —> 0 can be classified as one of these 
limiting processes. We have already seen (Vol. I, Chap. VIII, 
p. 393—the definition and proofs can be applied unaltered to 
functions of several variables) that if the convergence of the 
sequence /,,(z) is uniform the limit function f(x) is continuous, 
provided that the functions /,,(a) are continuous. This continuity 
gives the equations 


f (é)= lim f (z)= be (im Sa(2)) = ra Sul€) an (lim fulc)), 


which express the reversibility of the order of the passages to the 
limit n — c and a -> €. 


Further examples of the part played by the question of the reversibility 
of the order of passages to the limit have already occurred, e.g. in the 
theorem on the order of partial differentiation, and we shall meet with 

5° (#912) 
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other examples later. Here we mention so the case of the function 


vy 
SG y= aa 


For fixed non-zero values of y we obtain the limit Base fe, y) = —1, while 
for fixed non-zero values of x we have im. f(z y) = vie 1. Thus 


lim (lim f(z, y)) = Ii, (lim f(x, y)) 


ys 0 ze > 


and the order of the passages to the limit is not immaterial. This is of 
course connected with the discontinuity of the function at the origin. 


In conclusion we remark that for continuous variables the 
resolution of a double limit into successive ordinary limiting pro- 
cesses and the reversibility of the order of the passages to the limit 
are controlled by theorems which correspond exactly to those estab- 
lished on p. 103 for double sequences. 


3. Dini’s Theorem on the Uniform Convergence of Monotonic 
Sequences of Functions. 


In many refined analytical investigations it is useful to be 
able to apply a certain general theorem on uniform convergence, 
which we shall state and prove here. We already know (Vol. I, 
p. 387 ef seq.) that a sequence of functions may converge to a 
continuous limit function, even though the convergence is not 
uniform. In an important special case, however, we can conclude 
from the continuity of the limit that the convergence is uniform. 
This is the case in which the sequence of functions is monotonic, 
that is, when for all fixed values of x the value of the function 
fn(x) either increases steadily or decreases steadily as n increases. 
Without loss of generality we may assume that the values increase, 
or do not decrease, monotonically; we can then state the follow- 
ing theorem: 

If in the closed region R the sequence of continuous functions 
£,(x, y) converges to the continuous lamit function f(x, y), and if 
at each point (x, y) of the region the inequality 


FatilZ, y) = fal, y) 


holds, then the convergence is uniform in R. 
The proof is indirect, and is a typical example of the use 
of the principle of the point of accumulation. If the convergence 
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is not uniform, a positive number a will exist such that for arbi- 
trarily large values of n—say for all the values of » belonging 
to the infinite set n,, v.,...—the value of the function at a 
point P,, in the region, f,(P,,), differs from f(P,,) by more than a. 
If we let » run through the sequence of values ”, 2, ..., the 
points P,,, P,,, ... will have at least one point of accumulation 
Q; and since R is closed, Q will belong to R. Now ior every point 
P in R and every whole number p we have 


S(P) = FAP) + BCP), 


where f,(P) and the “ remainder ” R,(P) are continuous functions 
of the point P. In addition, 


R,(P) 2 R,(P), 


whenever n> yp, as we assumed that the sequence increases 
monotonically. In particular, for n > pw the inequality 


R,(Pr) 2R,(P,) 2a 


will hold. If we consider the sub-sequence Pest ao as o34 
of the sequence which tends to the limit point Q, on account of 
the continuity of R, for fixed values of 4 we also have R,(Q)= a. 
Since in this limiting process the suffix n increases beyond all 
bounds, we may take the index » as large as we please, for the 
above inequality holds whenever n > , and in the sequence of 
points P,, tending to Q there are an infinite number of values 
of the suffix m, hence an infinite number of values of n greater 
than p. But the relation R,(Q) = a for all values of » contradicts 
the fact that R,(Q) tends to 0 as » increases. Thus the assump- 
tion that the convergence is non-uniform leads to contradiction, 
and the theorem is proved. 


EXAMPLES 


1. State whether the following limits exist: 


. (logn)? — (log m)* 
(2) baad (logn)? + (log m)?” 
a—>o 


A tann tanm 

O): tn, a 

n—>ol — tanntanm 
m—>o 


(c) lim — z cos —. 
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2. Prove that a function f(z, y) is continuous, if 

(a) when y is fixed f is a continuous function in 2; 

(6) when 2 is fixed f is uniformly continuous in y, in the sense that 
for every < there is a 8, independent of z and y, such that 


| f(z) —f(% y)| Se 


lu —y| S68. 
3. Prove that f(z, y) is continuous at «= 0, y= 0, if the function 
Dt, o) = f(t cosg, ¢ sing) is 
(2) a continuous function of ¢ when ¢ is fixed; 
(6) uniformly continuous in » when ¢ is fixed, so that for every « there 
is a 8, independent of ¢ and 9, such that 


| D(t, 1) — Of, o)| Se 


le. —e|S8. 


4. Prove that the complementary set of a closed set 8 (i.e. the set of 
all points not in 8) is an open set. 


when 


when 


3. Homogenzovus Functions 


We finally touch on one other special point, the theory of 
homogeneous functions. The simplest homogeneous functions 
occurring in analysis and its applications are the homogeneous 
polynomials in several variables. We say that a function of the 
form az + by is a homogeneous function of the first degree in 
x and y, that a function of the form az? + bay + cy* is a homo- 
geneous function of the second degree, and in general that a 
polynomial in x and y (or in a greater number of variables) is a 
homogeneous function of degree h if in each term the sum of the 
indices of the independent variables is equal to h, that is, if the 
terms (apart from constant coefficients) are of the form 2", 
gh-ly, gh-2y2, . . , yt, These homogeneous polynomials have the 
property that the equation 


Sf (tz, ty) = Uf (zx, y) 


holds for every value of ¢ We now say in general that a 
function f(x, y, ...) ts homogeneous of degree h if tt satisfies 


the equation 
S(t, ty,.. )= Of Yy,. +) 


IJ] HOMOGENEOUS FUNCTIONS 109 


Examples of homogeneous functions which are not polynomials are 
tan(), (= 0) 
ze, 
asin” + ya + Plog + ¥, (k= 2). 
i] 


Another example is the cosine of the angle between two vectors with the 
respective components 2, y, z and u, v, w: 
oe es ee 
Vetta Yul+ oF + w? 
The length of the vector with components 2, y, 2, 
Vet y + a 


is an example of a function which.is posttively homogeneous and of the 
first degree; that is, the equation defining homogeneous functions does 
not hold for this function unless ¢ is positive or zero. 


Homogeneous functions which are also differentiable satisfy 
the characteristic Euler’s relation 


fet yfytofet...=Af(s.y,z,...). 


To prove this we differentiate both sides of the equation 
f(tz, ty, ...)= f(x, y, ...) with respect to ¢ this is per- 
missible, since the equation is an identity int. Applying the 
chain rule to the function on the left, we obtain 


af,(tx, ty,...) + yfi(ta, ty,...) +... hf (a, y,...). 


If we substitute ¢ == 1 in this, the statement follows. 

Conversely, it is easy to show that not only is the validity of 
Kuler’s relation merely a consequence of the homogeneity of the 
function f(x, y, ...), but also the homogeneity of the function 
is a consequence of Euler’s relation, so that Huler’s relation is 
@ necessary and sufficient condition for the homogeneity of the 
function. The fact that a function is homogeneous of degree h 
can also be expressed by saying that the value of the function 


divided by 2* depends only on the ratios y/z, z/z,.... It is 
therefore sufficient to show that it follows from the Euler 
relation that if new variables = 2, 7 = 4 C= *, +. are 


introduced, the function 
alley % j= al(é n€, cé, oe -)= 9(€, ” te. -) 
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no longer depends on the variable €, i.e. that the equation g; = 0 
is an identity. In order to prove this we use the chain rule: 
1 h 
9= (fet aht-- ) aT gal 


1 h 
= (fet Wt. oa sal 


The expression on the right vanishes in virtue of Euler’s relation, 
and our statement is proved. 

This last statement can also be proved in a more elegant but 
less direct way. We wish to show that from Euler’s relation it 
follows that the function 

g(t) = Of (a, y, .. .) —f (ta, ty, ...) 
has the value 0 for all values of ¢. It is obvious that g(1) = 0. 
Again, 
g(t) = hf (aw, y, ...) — afta, ty, ...) — yf (ta, ty,...)— eee 
On applying Euler’s relation to the arguments tx, ty,... we 
find that 


h 
af (tx, ty,...) + yf, (te, ty,... +...= yf e ty, «+ +), 
and thus g(t) satisfies the differential equation 


g(t) = 9) >. 


If we write g(t) = y(t)t* we obtain g’(é) = ow) + ty'(é), so that 
y(t) satisfies the differential equation : 


thy'(é) = 0, 
which has the unique solution y = const. =. Since for ¢= 1 


it is obvious that y(t) = 0, the constant ¢ is 0, and so g(t) = 0 
for all values of ¢, as was to be proved. 


EXxaMP_irs 


1. Prove that if f(z, y, z,...) is a homogeneous function of degree h, 
any k-th derivative of f is a homogeneous function of degree 4 — k. 


2. Prove that for a homogeneous function f of the first degree 
X fre +P Say + Phage Ht 2H fey +... = 0. 


CHAPTER III 


Developments and Applications of the 
Differential Calculus 


1. Imericir Functions 


1. General Remarks. 


In analytical geometry it frequently happens that the equation 
of a curve is given, not in the form y= f(z), but in the form 
F(z, y)== 9. Accordingly, a straight line may be represented 
by the equation ax + by + ¢= 0, or an ellipse by the equation 
x/a* + y?/b2 = 1. To obtain the equation of the curve in the 
form y = f(x) we must “solve ” the equation F(z, y) = 0 for y. 

Again, in Vol. I we considered the problem of finding the 
inverse function of a function y= (f(z), in other words, the 
problem of solving the equation F(x, y) = y — f(x) = 0 for the 
variable z. These examples suggest the importance of studying 
the notion of solving an equation F(a, y)= 0 for « or for y. 
We shall now proceed to this investigation, and in section 3 
(p. 153) we shall extend the results to functions of several variables. 

In the simplest cases, such as the equations mentioned above, 
the solution can readily be found in terms of elementary func- 
tions. In other cases the solution can be approximated to as 
closely as we desire. For many purposes, however, it is preferable 
not to work with the solved form of the equation or with these 
approximations, but instead to draw conclusions about the 
solution by studying the function F(z, y) itself, in which neither 
of the variables x, y is given preference over the other. 

The idea that every function F(z, y) yields a function y = f(z) 
or = ¢(y) given implicitly by means of the equation F(z, y)=0 
is erroneous. On the contrary, it is easy to give examples of 


functions F(z, y) which, when equated to zero, permit of no 
111 
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solution in terms of functions of one variable. Thus, for example, 
the equation 2? + y? = 0 is satisfied by the single pair of values 
x= 0, y= 0 only, while the equation 2+ y?+ 1= 0 is satis- 
fied by no (real) values at all. It is therefore necessary to in- 
vestigate the matter more closely in order to find out whether 
an equation F(z, y) = 0 defines a function y= f(z), and what 
are the properties of this function. 


2. Geometrical Interpretation.* 


In order to clarify the situation we think of the function 
u= F(x, y) as represented by a surface in three-dimensional 
space. The solutions of the 
equation F(x, y) = 0 are the 
same as the simultaneous 
solutions of the two equa- 
tions u = F(x, y) and u= 0. 
Geometrically, our problem 
is to find whether curves 
y = fle) or 2 = gly) exist in 
which the surface u = F(z, y) 
intersects the zy-plane. (How 
far such a curve of inter- 
section may extend does not 
concern us here.) 

A first possibility is that 

Fig. 1.—The surface u = xy the surface and the plane 

may have no point in com- 

mon. For example, the paraboloid u= F(z,y)= 2+y¥+1 
lies entirely above the ay-plane. In such a case there is 
obviously no curve of intersection. We therefore need only 
consider cases in which there is a point (2, Yo) at which 
F(a, Yo) = 0; the values 2, Yo are called an “ initial solution ”. 

If an initial solution exists, two possibilities remain. Either 
the tangent plane at the point (x9, yo) is horizontal or it is not. 
If it is, we can readily show by means of examples that the 
solution y = f(x) or = ¢{y) may fail to exist. For example, 
the paraboloid u = 2? + y? has the initial solution z = 0, y = 0, 
but has no other point in the xy-plane. Again, the surface 
u= ay has the initial solution z=0, y= 0, and in fact 

* Cf, also Vol. I, Chap. X, section 5 (pp. 481-5). 
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intersects the ay-plane along the lines = 0 and y=0 (cf. 
figs. 1, 2). But in no neighbourhood of the origin can we represent 
the whole intersection by a function y= f(z) or by a function 
a= ¢(y). On the other hand, it is quite possible for the equation 
F(x, y) = 0 to have a solution, even when the tangent plane. at 
the initial solution is horizontal, as, for example, in the case 
(y — x)= 0. In the (exceptional) case of a horizontal tangent 
plane, therefore, no definite general statement can be made. 

The remaining possibility is that at the initial solution the 
tangent plane is not horizontal. Then intuition tells us, roughly 
speaking, that the surface u= F(z, y) cannot bend fast enough 


Fig. 2.—Contour lines of u = xy 


to avoid cutting the vy-plane near (29, Yo) in a single well-defined 
curve of intersection, and that a portion of the curve near the 
initial solution can be represented by the equation y= f(x) or 
a= ¢(y). The statement that the tangent plane is not horizontal 
is the same as the statement that F',(%», yo) and F(x», yo) are 
not both zero. This is the case which we shall discuss analytically 
in the next sub-section. 


3. The Theorem of Implicit Functions. 


The general theorem which states sufficient conditions for the 
existence of implicit functions and at the same time gives a rule 
for differentiating them is as follows: 
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If F(x, y) has continuous derivatives F,, and F,, and if at the 
point (Xo, Yo) within its region of definition the equation F(X, Yo) = 0 
as satisfied, while F(X, yy) is not zero, then we can mark off about 
the point (Xo, Yo) a rectangle x, <= x <x, y, Sy Syp such that 
for every x in the interval x, < x S x, the equation F(x, y)=0 
determines exactly one value y = (x) lying in the interval y, < y 
Sye. This function satisfies the equation y= f(x,), and for 
every x in the interval the equation 


F(z, f(a)) = 0 


is satisfeed. The function f(x) is continuous and differentiable, 
and its derivative and differential are given by the equations 


ar eee = — _ -e 
y =f'(z)= 7 dy = df (x) = a 


respectively. 

We shall assume for the present that the first part of the 
theorem, relating to the existence and continuity of the implicitly- 
defined function, is already proved, and shall confine ourselves to 
proving the differentiability of the function and the differentiation 
formule; the proof of the existence and continuity of the solution 
we shall postpone to sub-section 6 (p. 119). 

If we could differentiate the terms of the equation F(a, f(x))=0 
by the chain rule, the above equation would follow at once.* 
Since, however, the differentiability of f(x) must first be proved, 
we must consider the matter in somewhat greater detail. 

As the derivatives F, and F,, have been assumed continuous, 
the function F(x, y) is differentiable. We can therefore write 

Fath, y+k)= F(z, Y) thE (a, y) + kF (a, y)+eh+ ek, 
where ¢, and ¢, are two quantities which tend to zero as h and k 
do or as p = +4/(h? + k?) does. We now confine our attention to 
pairs of values (x, y) and (w+ h, y+ k) for which both x and 
x -+ h lie in the interval 2, < x S 2, and for which y = f(z) and 


y+k=f(e+h). For such pairs of values we have F(z, y) = 0 
and F(e + h, y+ k) = 0, so that the preceding equation reduces 
to 


O= hF, 4+ kFy + qh-+ ek. 
We assume that f(x) has been proved continuous. Hence as 
* Cf. Vol. I, p. 483. 
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tends to 0, so does &, and with them ¢, and ¢, also tend to 0. If 
we divide by hF, (which by hypothesis is not zero), the last 


equation gives 
kF € 
y £)\84 224 1 —0, 
C+ Et EtE 


and on performing the passage to the limit h > 0 we have 


a ee 
ok 
But 
k_fle+™— Fa), 
A r) . 


this proves the differentiability of f(x) and gives the required rule 
for differentiation, 


1 Yim LEAD —fO — jim k_Fs 

carne h moar 
We can also write this rule in the form 
F,4+ F,y =0 


or 
dF = F,dx+ F,dy=0. 


This last equation states that in virtue of the equation F(z, y) = 0 
the differentials dx and dy cannot be chosen independently of 
one another. 

An implicit function can usually be differentiated more easily 
by using this rule than by first writing down the explicit form of the 
function. The rule can be used whenever the explicit representation 
of the function is theoretically possible according to the theorem 
of implicit functions, even in cases where the practical solution 
in terms of the ordinary functions (rational functions, trigono- 
metric functions, &c.) is extremely complicated or impossible. 

Suppose that the second order partial derivatives of F(z, y) 


exist and are continuous. In the equation y’ = — F? whose 
v 
right-hand side is a compound function of x, we can differentiate 
according to the chain rule and then substitute for y’ its value 
F, ee 
— —*. This gives 
F gi 


116 DEVELOPMENTS AND APPLICATIONS [Cuap. 


y= — Pog By} — Way PoP y + Py Pd 
F? 


as the formula for the second derivative of y = f(z). 
In the same way we can obtain the higher derivatives of 
J(x) by repeated differentiation. 


4, Examples. 
1. For the function y = f(x) obtained from the equation of the circle 
F(a, y) = P+ y2—-1=0 


we obtain the derivative 


This can easily be verified directly. If we solve for y, the equation 
of the circle gives either the function y= V(1— 2?) or the function 
y= —V(1— 2*), representing the upper and lower semicircles respec- 
tively. In the first caso differentiation gives 


? Fo 
y= — 

/(1 — 2*) 
and in the second case 


bk x 
oie 
Thus in both cases y/ = — Zs 


y 
2. In the case of the lemniscate (Vol. 1, p. 72) 
F(a, y) = (2 + y?? — 2a%(2? — y*) = 0 


it is not easy to solve for y. For «= 0, y = 0 we obtain F = 0, F, = 0, 
F, = 0. Here our theorem fails, as might be expected from the fact that 
two different branches of the lemniscate pass through the origin. For all 
points of the curve for which y + 0, however, our rule applies, and the 
derivative of the function y = f(z) is given by 


Fy, 4a(x*2+ y*) — 40x 

Fy dy (a8 + y) + dary 
We can obtain important information about the curve from this equation, 
without bringing in the explicit expression for y. For example, maxima 
or minima may occur where y’ = 0, that is, for z= Q or for 2* + y* = a?, 


From the equation of the lemniscate, y = 0 when 2 = 0; but at the origin 
there is no extreme value (cf. fig. 26, Vol. I, p. 72). The two equations 


therefore give the four points (+3 v3, +5 as the maxima and minima. 
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3. In the case of the folium of Descartes 
F(z, y) = 8 + y° — 3azy = 0 
(cf. fig. 3), the explicit solution would be exceedingly inconvenient. At the 


Fig. 3.—Folium of Descartes 


origin, where the curve intersects itself, our rule again fails, since at that 
point F = F, = F,= 0. For all points at which y* + ax we have 
y= F, = a — ay 


F, y? — ax 


Accordingly, there is a zero of the derivative when 27 — ay = 0, or, if we 
use the equation of the curve, when 


w= ar/2, y= arv/4. 


5. The Theorem of Implicit Functions for more than Two Inde- 
pendent Variables. 


The general theorem of implicit functions can be extended to 
the case of several independent variables as follows: 

Let F(x, y, . . . , Z, u) be a continuous function of the independent 
variables x, y,..., Z, u, and let t possess continuous partial 
derivatives F,, F,,..., F,, Fy. For the system of values Xo, Yo, -- +; 
Zg) Ug corresponding to an interior point of the region of definition 
of F, let F(Xo, Yo; « - + » Zq, Ug) = 0 and 


Fy (Zo, Yor - «+ > 2, Ug) + O. 


Then we can mark off an interval u, Su Su, about uy and a 
region R containing (Xp, Yo, --- » Zo) im ts interior such that for 
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every (X, y,..., Z) im R the equation F(x, y, ... , z, u) = 0 is 
satisfied by exactly one value of u in the interval u, Sus Uy. 
For this value of u, which we denote by u = f(x, y,..., 2), the 
equation 
F(a, y, 26+, % f(t, Y)++-+52))=0 
holds identically in R; in addition, 
Ug =f (xo, Yo ee ey 2). 


The function f ts a continuous function of the independent variables 


X,Y, +-., 2, and possesses continuous partial derivatives given by 
the equations 

Ff at F ul 2 0, 

PF yt Ff. wl y= 9, 


F,+ Ff, =0. 

For the proof of the existence and continuity of f(a, y, . . . , z) 
we refer the reader to the next sub-section (p. 121). The formule 
of differentiation follow from those for the case of one independent 
variable, since we can e.g. let y,..., 2 remain constant and thus 
find the formula for f,. 


If we wish, we can combine our differentiation formule in 
the single equation 


Fide+ Fydy+...+F,dz+ Py,du=0. 
In words: 


If in a function F(x, y, ..., 2, u) the variables are not inde- 
pendent of one another, but are subject to the condition F = 0, 
then the linear parts of the increments of these variables are likewise 
not independent of one another, but are connected by the condition 
dF = 0, that is, by the linear equation 


F,de + Fidy +...4+ F,dz-+ F,du=0. 


If we here replace du by the expression u,dx+ uydy+... 
+ u,dz and then equate the coefficient of each of the mutually 
independent differentials dx, dy, ... , dz to zero, we again obtain 
the above differentiation formule. 

Incidentally, the concept of implicit functions enables us 
to give a general definition of the concept of an algebraic function. 
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_ We say that u = f(z, y, . . .) is an algebraic function of the inde- 
pendent variables x, y, ... if w can be defined implicitly by an 
equation F(z, y,..., u)= 0, where F is a polynomial in the 
arguments 2, y, ..., u; briefly, if u “satisfies an algebraic 
equation”. All functions which do not satisfy an algebraic 
equation are called transcendental. 


As an example of our differentiation formuls we consider the equation 
of the sphere, 
2+ty+t+w—1=0. 
For the partial derivatives we obtain 
z 
Uy =~ ->» Uy = g 


and by further differentiation 


Se Bie ee a2 + ut 
ia — te 
2 ay 
May = y= a? 
Rs RR fa 
Uy = — ~ + ty = — " 
uw us 


6. Proof of the Existence and Continuity of the Implicit Functions. 


Although in many special cases the existence and continuity 
of implicit functions follows from the fact that the equation 
F(x, y) = 0 can actually be solved in terms of the usual functions 
by means of some special device, yet it is still necessary to give 
a general analytical proof of the existence theorem stated above. 

As a first step we mark out a rectangle 7, <7 2,,y,S yy 
in which the equation F(x, y) = 0 determines a unique function 
y = f(z). We shall make no attempt to find the largest rectangle 
of this type; we only wish to show that such a rectangle exists. 

Since F(z, y) is continuous and F,(z9, Yo) + 0, we can find 
a rectangle R, with the point P(x», yg) as centre, so small that in 
the whole of R the function F', remains different from zero and 
thus is always of the same sign. Without loss of generality we 
can assume that this sign is positive, so that F, is positive every- 
where in R; otherwise, we should merely have to replace the 
function F by —/, which leaves the equation #(z, y)= 0 un- 
altered. Since F, > 0 on every line-segment a = const. parallel 
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to the y-axis and lying in R, the function F(x, y), considered as a 
function of y alone, is monotonic increasing. But F(x; Yo) = 0; 
hence if A is a point of R with co-ordinates x, and Y1 (Yr < Yo) 
on the vertical line through P (ef. fig. 4), the value of the 
function at A, F(2», y,), is negative, while at the point B with co- 
ordinates % and ys (yg > yo) the value of the function, F(a», ¥2), 
is positive. Owing to the con- 
tinuity of F(a, y), it follows 
that F(x, y) has negative values 
along a certain horizontal line- 
segment y = y, through A and 
lying in R, and has positive 
values along a line-segment 
y= y, through B and lying in 
R. We can therefore mark off 
an interval 2, <2 <2, about 
. % so small that for values of z in that interval the function 
F(z, y) remains negative along the horizontal through A and 
positive along the horizontal through B. In other words, 
for 2, Sz 2, the inequalities F(z, y,) <0 and F(z, y.) > 0 
hold. 

We now suppose that x is fixed at any value in the interval 
% StS, and let y increase from y, to yo. The point (2, y) 
then remains in the rectangle 


Fig. 4 


%StSm® 4 SYSY, 


which we assume to be completely within R. Since F(z, y)>0, 
the value of the function F(z, y) increases monotonically and 
continuously from a negative to a positive value, and can never 
have the same value for two points with the same abscissa. 
Hence for each value of x in the interval 7, Sx < 2, there is a 
uniquely determined * value of y for which the equation F(a, y) = 0 
is satisfied. This value of y is thus a function of 2; we have 
accordingly proved the existence and the uniqueness of the 
solution of the equation F(x, y) = 0. At the same time the part 
played by the condition F, + 0 has been clearly brought out. 


* If the restriction y, < y S y, is omitted, this will not necessarily remain 
true. For example, let F be z? + y? — 1 and let 2) = 0, yy = 1. Then for 
—} SS } there is just one solution, y = f(x), in the interval 0 Sy S 2; but 
if y is unrestricted, there are two solutions, y = V(1 — 2?) andy = — V(1 — 2°), 
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If this condition were not fulfilled, the values of the function at A 
and at B might not have opposite signs, so that F(z, y) need not 
pass through zero on vertical line-segments. Or, if the signs 
at A and at B were different, the derivative F, could change 
sign, so that for a fixed value of x the function F(x, y) would 
not increase monotonically with y and might assume the value 
zero more than once, thus destroying the uniqueness of the 
solution. 

This proof merely tells us that the function y= F(z) exists. 
It is a typical case of a pure “ existence theorem ”, in which the 
practical possibility of calculating the solution does not come 
under consideration at all.* 

The continuity of the function f(z) follows almost at once from 
the above considerations. Let R(z,’S2S%,', m4’ SyS%’) 
be a rectangle lying entirely within the rectangle 7, Sa S 4, 
¥, YS Yq found above. For this smaller rectangle we can 
carry out exactly the same process as before in order to obtain 
a solution y= f(x) of the equation F(x, y)= 0. In the larger 
rectangle, however, this solution was uniquely determined; hence 
the newly-found function f(x) is the same as the old one. If we 
now wish e.g. to prove the continuity of the function f(x) at the 
point x = x9, we must show that for any small positive number « 
| f(x) —f (xo) | < , provided only that 2 lies sufficiently near the 
point %). For this purpose we put 


Yi =Yote and yy’ =Yo—s 


and for these values y,' and y,’ we determine the corresponding 
z-interval x,'< 2S a,'. Then by the above construction, for 
each x in this interval the corresponding f(x) lies between the 
bounds y,’ and y,’, and therefore differs from y, by less than «. 
This expresses the continuity of f(x) at the point x). Since we 
can apply the above argument to any point « in the interval 
ty SU < Xp, we have proved that the function is continuous at 
each point of this interval. 

The proof of the general theorem for F(x, Y, 2-55 % %); 
a function with a greater number of independent variables, 
follows exactly the same lines as the proof just completed, 
and offers no further difficulties. 


* The sacrifice of the statement of such practical methods in a general proof 
is sometimes an essential step towards the simplification of proofs. 
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ExaMP.Les 
1. Prove that the following equations have unique solutions for y near 
the points indicated: 
(a) @+ay+y=7 (2,1). 
(b) x coszy = 0 (1, 7/2). 
(c) zy+ logay=1 = (1, 1). 
(2) B+ y5 + xy=3 (1,1). 
2. Find the first derivatives of the solutions in Ex. 1. 
3. Find the second derivatives of the solutions in Ex. 1. 
4. Find the maximum and minimum values of the function y = f(x) 
defined by the equation 2? + xy + y* = 27. 


5. Show that the equation z-++ y+ z= sinayz can be solved for z 
near (0, 0, 0). Find the partial derivatives of the solution. 


2. CurvES aND Surraces 1n Impuicir Form 


1. Plane Curves in Implicit Form. 


We have previously expressed plane curves in the form 
y = f(z), which is unsymmetrical, giving the preference to one 
of the co-ordinates. The tangent and the normal to the curve 
are found to be given by the equations 


(n— 9) — (§—a)f'(a) =0 
(n— y)f'(@) + (€-2)=0 


respectively, where £ and 7 are the current co-ordinates of the 
tangent and the normal, and 2 and y are the co-ordinates of 
the point of the curve. We have also found an expression for 
the curvature, and criteria for points of inflection (Vol. I, 
Chap. V). We shall now obtain the corresponding formule 
for curves which are represented implicitly by equations of the 
type F(x, y)= 0. We do this under the assumption that at 
the point in question F, and F, are not both zero, so that 
FA+ Fj? + 0. 

If we suppose that F, + 0, say, we can substitute for y’ in 
the equation of the tangent at the point (z, y) of the curve its 
value —F,/F,, and at once obtain the equation of the tangent 
in the form 


and 


(§—2)F,+ (n—y)F,=0. 
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Similarly, for the normal we have 
(€— x) F, — (yn — y)F, = 9. 


Without going out of our way to use the explicit form of the 
equation of the curve, we can also obtain the equation of the 
tangent directly in the following way. If a and b are any two 
constants, the equation 


a(é — 2) + b(n — y) = 90 


with current co-ordinates € and 7 represents a straight line 
passing through the point P(a, y). If now P is any point of the 
curve, i.e. if F(z, y) = 0, we wish to find the line through P with 
the property that if P, is a point of the curve with co-ordinates 
a, =2-+h and y,=y+h, the distance from the line to P, 
tends to zero to a higher order than p = +/(h? + k*). In virtue 
of the differentiability of the function F we can write 


F(e +h, y +b) = Fla, y) + hFe + kFy + ep, 


where p tends to 0 as « does. Since the two points P and P, 
both lie on the curve, this equation reduces to hF, + kF, = —ep. 
As we have assumed that F,?-++ F,? + 0, we can write this last 
in the form 


F F 
h—_—__* —_ + k ____ _—* —, = <p, 
Veerr sa!“ vre+RAy 
where €,= — ____£ ____ also tends to zero as p does. If we 
EP FY) . 
write @ = d b= I, the left-hand 


VEE+EA VEE 
side of this equation may be regarded as the expression obtained 
when we substitute the co-ordinates of the point (%,= «+ h, 
t= y +h) for € and 7 in the canonical form of the equation 
of the line, a(€ — 2) + b(n — y) = 0. This is the distance of 
the point P, from the line. Thus the distance of P, from the 
line is numerically equal to | ¢,p|, which vanishes as p does to 
a higher order than p. The equation 


F, F, 
yesera?— 9+ Geeta" 


FAgé— 2) + Fyn —y) =9 


—y)=0 


or 
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is the same as the equation of the tangent found in the preceding 
paragraph. We can therefore regard the tangent at P as that 
line * whose distance from neighbouring points P, of the curve 
vanishes to a higher order than the distance PP,. 

The direction cosines of the normal to the curve are given by 
the two equations 
S| eee ee ee 
V(F* + F,?) V(F e+ F,)’ 
which represent the components of a unit vector in the direction 
of the normal; that is, of a vector with length 1 in the direction 
of the normal at the point P(z, y) of the curve. 

The direction cosines of the tangent at the point P(a, y) are 
given by 


cosa = 


cosp = bane Was sinp = Stic s eee 
V(F.P + F,?) V(F2 + F,?) 


More generally, if instead of the curve F(x, y) = 0 we 
consider the curve 
F(a, y) =, 


where ¢ is any constant, everything in the above discussion 
remains unchanged. We have only to replace the function 
F(x, y) by F(z, y)—c, which has the same derivatives as 
the original function. Thus for these curves the equation 
of the tangent and the normal have exactly the same forms 
as above. 

The class of all the curves which we obtain when we allow 
c to range through all the values in an interval is called a family 
of curves. The plane vector with components F, and F,, which 
is the gradient of the function F(z, y), is at each point of the plane 
perpendicular to the curve of the family passing through that point, 
as we have already seen on p. 90. This again yields the equation 
of the tangent. For the vector with components (¢— x) and 
(y — y) in the direction of the tangent must be perpendicular to 
the gradient, so that the scalar product 


(€—2)F,+ (n—y)F, 
must vanish. 


* The reader will find it easy to prove for himself that two such lines can- 
not exist, so that our condition determines the tangent uniquely. 
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While we have taken the positive sign for the square root 
occurring in the above formule, we could equally well have 
taken the negative root. This arbitrariness corresponds to the 
fact that we can call the direction towards either side of the 
curve the positive direction at will. We shall continue to choose 
the positive square root and thereby fix a definite direction of the 
normal. It is, however, to be observed that if we replace the 
function F(x, y) by —F(a, y) this direction is reversed, although 
the geometrical nature of the curve is unaffected. (As regards 
the sign of the normal, cf. Chap. V, section 2 (pp. 363-4)). 

We have already seen (Vol. I, p. 159) that for a curve ex- 
plicitly represented in the form y = f(x) the condition f”(z) = 0 
is a necessary condition for the occurrence of a point of inflection. 
If we replace this expression by its equivalent, 

”" Fae Fy? — 2FeyF Py + Fv Po 
f (x) aa F,3 ? 
we obtain the equation 


Poe F,? — 2F Fe PF, + Fy Fa? = 0 


aS a necessary condition for the occurrence of a point of inflec- 
tion. In this condition there is no longer any preference given 
to either of the two variables z, y. It has a completely sym- 
metrical character and no longer depends on the assumption 
that F, + 0. 

If we substitute for y’ and y” in the formula for the curvature 
found previously (Vol. I, p. 281) 


ae aeaee 
V(L+ y'?)8 
we obtain the formula 
Fan F QProy Pe ky + Fy, FF 


be Ret Fa 


which is likewise perfectly symmetrical.* For the co-ordinates 
(é, ) of the centre of curvature we obtain the expressions 


— F, 
bat PRL Fay 


* For the sign of the curvature cf. Vol. I, p. 282. 
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F 
ay y ; 
TOON OEE + FP 
where 
= 81 
P= 7 


If the two curves F(x, y) = 0 and G(z, y) = 0 intersect one 
another at the point with co-ordinates x, y, the angle between 
the curves is defined as the angle w formed by their tangents (or 
normals) at the point of intersection. If we recall the ex- 
pressions given above for the direction cosines of the normals 
and the formula for the scalar product (Chap. I, section 1, 
p. 8), we obtain the expression 


FG, + FyGy 
VEE + BP) V (GP + G7) 


for the cosine of this angle. Since we have taken the positive 
square roots here, the cosine is uniquely determined; this corre- 
sponds to the fact that we have thereby chosen definite directions 
for the normals and have thus determined the angle between 
them uniquely. 

By putting w= 7/2 in the last formula we obtain the 
condition for orthogonality, i.e. that the curves intersect at right 
angles, 


cosw = 


FiGz + Fy Gy = 0. 


It the curves are to touch one another, the ratio of the dif- 
ferentials, dy: dx, must be the same for the two curves. That 
is, the condition 

dy: da= —F,: Fy = —G,:G, 


must be fulfilled. This may also be written in the form 
F,G, — F,G, = 0. 
As an example we consider the parabolas 
= Ps 
y — 2p (« + 2 =0 


(cf. fig. 9, p. 137), all of which have the origin as focus (“ confocal” 
parabolas). If », > 0 and p, < 0, the two parabolas 


F=y— 2%,(2+?!)=0 and G= y*— %, (2+ B) =o 
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intersect one another, and at the intersection they are at right angles to 
one another, for 
2 pol as pyG 
F Gz + FyGy = 40172. + 44 = 44 ——— = 0, 
Pe— Pi 
since 
F=G=0, p—-p + 90. 


As a second example we consider the ellipse 


The equation of the tangent at the point (x, y) is 


(&— 25 +(a—9) B= 0 
or 
x y _ 


as we know from analytical geometry. 
We find that the curvature is 


ae atbt 

(aty? + 642)" 
If a > b, this has its greatest value a/b? at the vertices y = 0, x = +a. 
Its least value b/a® occurs at the other vertices x = 0, y = 1b. 


2. Singular Points of Curves. 


We now add a few remarks on the singular points of a curve. 
Here we shall content ourselves with giving a number of typical 
examples; for a more thorough investigation we refer the reader 
to the appendix to this chapter (p. 209). 

In the formule obtained above the expression f,2-+ F,? 
frequently occurs in the denominator. Accordingly we may 
expect something unusual to happen when this quantity vanishes, 
ie. when F, == 0 and F, = 0 at a point of the curve. This is 
especially brought out by the fact that at such a point the ex- 
pression y’ = —F',/F, for the slope of the tangent to the curve 
loses its meaning. 

We say that a point of a curve is a regular point if in the neigh- 
bourhood of this point either the co-ordinate y can be represented 
as a continuously differentiable function of x, or else x can be 
represented as a continuously differentiable function of y. In 
either case the curve has a tangent, and in the neighbourhood of 
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the point in question the curve differs but little from that 
tangent. All other points of a curve are called singular points 
(or singularities). 

From the theory of implicit functions we know that a point 
of the curve F(x, y) = 0 is regular if at that point F,, + 0, since 
we can then solve the equation so as to obtain a unique dif- 
ferentiable solution y = f(z). Similarly, the point is regular if 
F,+0. The singular points of the curve are accordingly to be 
sought for among those points of the curve at which the equations 


F,=0, F,=0 


are satisfied in addition to the equation of the curve. 

An important type of singularity is a multiple point, that is, 
a point through which two or more branches of the curve pass. 
For example, the origin is a multiple point of the lemniscate 


(x* + y?)? — 2a%(a? — 2) — 0. 


In the neighbourhood of such a point it is impossible to express 
the equation of the curve uniquely in the form y= f(z) or 
«= G(y). 

The truth of the rela- 
tions F,—= 0 and F,=0 
is a necessary, but by no 
means a sufficient, condi- 
tion for a multiple point; 
on the contrary, quite a 
different type of singularity 
may occur, such as a cusp. 

As an example we consider 
Fig. 5.—The surface y? — x? = G the curve 

y—at=0 


(ef. fig. 5), which has a cusp at the origin. At that point both the first 
partial derivatives of F vanish. 


Moreover, cases may occur in which F, and ¥F, both vanish, 
and yet there is no striking peculiarity of the curve at the point, 
the curve being regular there. 

This is exemplified by the curve 

y—t=0 
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or, in explicit form, 
y= at3, 


From the equations (— 2) = 2“, y’ = 42: we see at once that the curve 
is symmetrical with respect to the y-axis and touches the x-axis at the 
origin, like a parabola. Yet the origin is a somewhat special point on the 
curve, since the second derivative is infinite there. The curvature is there- 
fore infinite, while the direction of the tangent exhibits no peculiarity. 
Another example is the curve (y— z= 0, which is a straight line and 
therefore regular throughout, even though F, = 0 and F, = 0 for every 
point of the line. 


As a result of this discussion we see that in the investigation 
and discussion of singular points of a curve it is not enough to 
verify that the two equations F,== 0 and F, = 0 are satisfied; 
on the contrary, each case must be studied specially (cf. Appendix, 
section 2, p. 209). 


3. Implicit Representation of Surfaces. 


Hitherto we have usually represented a function z= f(z, y) 
(here we write z instead of the symbol wu employed above) by 
means of a surface in xyz-space. If, however, we are originally 
given not the function, but a surface in space, the preference 
which this form of expression gives to the co-ordinate z may prove 
inconvenient, just as in the case of the expression of plane curves 
in the form y= f(x). It is more natural and more general to 
represent surfaces in space by equations of the form F(z, y, z) = 0 
or F(z, y, 2)= const., e.g. to represent the sphere by the equation 
gt y®-+ 22 — #2 — 0, and not by z= + 4/(r? — 2? — y?). The 
form z — f(x, y) = 0 can then be treated as a special case. 

In order to establish the equation of the tangent plane to 
the surface F(x, y, z) = 0 at the point (a, y, z), we first make the 
assumption * that at that point F,?-+ F,?-+ #2 +0; ie. that 
at least one of the partial derivatives, say F,, is not zero. Then 
from the equation of the surface we can determine z = f(z, y) 
explicitly as a function of z and y. If in the equation of the 
tangent plane 

[-2= (€ — a)Z_ + (n — y)zy 
we substitute for the derivatives z, and 2, their values 
* The vanishing of this expression indicates the possibility that certain singu- 
uss. 


laritice may ocour; this, however, we shall not disc 
6 (#912) 
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t= —F,/F, and z,— —F,/F,, we obtain the equation of the 
tangent plane in the form 


(€—2)F,+ (n— y)F,+ (€—2)F, = 0, 


where €, y, £ are current co-ordinates. 

As in the case of the tangent to a plane curve, we can derive 
this equation directly from the implicit representation of the 
surface, by setting ourselves the problem of finding a plane 
through the point (x, y, z) of the surface with the property that 
the distance from the plane to the point (e+ h, y+k, 2+) 
of the surface vanishes as p = +/(h? + k? + I?) does, to a higher 
order than p. 

Elementary theorems of analytical geometry (cf. Chap. I, 
section 1, p. 9) show that the direction cosines of the normal to 
the surface, that is, of the normal to the tangent plane, are given 
by the expressions 


cosa = ee eae cos B = Py 
OEP E ERE EY OO VEE T EET PY 
F, 
OY WEEE FET FA 


In taking the positive square root in the denominator we 
have assigned a definite sense of direction to the normal 
(cf. p. 125). 

If two surfaces F(x, y, z) = 0 and Gia, y, z) = 0 intersect one 
another at a point, the angle w between the surfaces is defined as 
the angle between their tangent planes, or, what is the same 
thing, the angle between their normals. This is given by 


PG, + FG, + FG, 
VF 2+ #2 + F2)V(G2+ 67+ G2) 
In particular, the condition for perpendicularity (orthogonality) is 
F,G,+ F,G,+ F.G,= 0. 


cos w = 


Instead of the single surface F(x, y, z) = 0 we may consider 
the whole family of surfaces F(x, y, 2) = c, where c is a constant 
different for each surface of the family. Here we assume that 
through each point of space, or at least through every point of a 
certain region of space, there passes one and only one surface 
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of the family; or, as we say, that the family covers the region 
simply. The individual surfaces are then called the level surfaces 
of the function F(z, y, z). In Chap. II, section 7 (p. 88) we con- 
sidered the gradient of this function, that is, the vector with 
the components F,, F,, F,. We see that these components have 
the same ratios as the direction cosines of the normal; hence 
we conclude that the gradient at the point with the co-ordinates 
(x, y, 2) 1s perpendicular to the level surface passing through that 
point. (If we accept this fact as already proved in Chap. II, 
section 7 (p. 90), we at once have a new and simple method for 
deriving the equation of the tangent plane, just like that given 
above (p. 124) for the equation of the tangent line.) 


As an example we consider the sphere 
2+ y+ 2— P= 0, 
At the point (x, y, z) the tangent plane is 
(& — x)2x + (n — y)2y + (8 — z)2z2=0 


e+ nyt &— r= 0. 


The direction cosines of the normal are proportional to a, y, 2; that is, the 
normal coincides with the radius vector drawn from the origin to the point 
(x, y, 2). 

For the most general ellipsoid with the co-ordinate axes as principal 
axes, 


a yt 2 


the equation of the tangent plane is 


x y z =a 


EXAMPLES 


1. Find the tangent plane 
(a) of the surface 
x + Qay* — 7284 3y4+1=—0 
at the point (1, 1, 1); 
(b) of the surface 
(22 + oy)? + a? — y? + Toy + Ba + et — = 14 
at the point (1, 1, 1); 
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(c) of the surface 
sin? x + cos(y + 2) = 2 
at the point (- = 0) 
p 6’ 3’ e 
2. Calculate the curvature of the curve 
sina + cosy =1 
at the origin. =, 
(3*, Find the curvature at the origin of each of the two branches of the 
curve 
ylaz + by) = ox? + exty + foy® + gy?. 
4, Find the curvature of a curve which is given in polar co-ordinates 
by the equation f(r, 6) = 0. 
5. Prove that the three surfaces of the family of surfaces 
Yau J@+AtVytA=4 Je@tA—-Vyt a= 


z 
which pass through a single point are orthogonal to one another. 


6. The points A and B move uniformly with the same velocity, A 
starting from the origin and moving along the z-axis, B starting from 
the point (a, 0, 0) and moving parallel to the y-axis. Find the surface 
enveloped by the straight lines joining them. 


"1, Prove that the intersections of the curve 
(e+ y— a)? + Wary = 0 


with the line x ++ y = a are inflections of the curve. 
8. Discuss the singular points of the following curves: 
(a) F(x, y) = aa® + by® — cay = 0; 
(0) F(x, y) = (y? — 22°)? — oF = 0; 
(c) F(a, y) = (1+ ey — x = 0; 
(d) F(x, y) = y°(2a — x) — a8 = 0; 
(e) F(a, y) = (y — 2a — 2 = 0. 


9. Let (x, y) be a double point of the curve F(z, y) = 0. Calculate the 
angle » between the two tangents at (x, y), assuming that not all the 
second derivatives of F vanish at (2, y). 

Find the angle between the tangents at the double point (a) of the 
lemniscate, (b) of the folium of Descartes (of. p. 116). 


10. Determine a and 6 so that the conics 
493 +. day + y? — 107 — 10y + 11=—0 
(y + ba — 1 — 6)? — a(by —2+1—-—b)=0 
cut one another orthogonally at the point (1, 1) and have the same curva- 
ture at this point. 
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ll. If F(a, y, z)= 1 is the equation of a surface, F being a homo- 
geneous function of degree h, then the tangent plane at the point (x, 9-2) 
is given by 

EF, + nFy + OF, = he 

12. Let K’ and K” be two circles having two points A and B in com- 
mon. If a circle K is orthogonal to K’ and K”, then it is also orthogonal 
to every circle passing through A and B. 

13. Let z be defined as a function of x and y by the equation 


a + y8 + 23 — Sayz = 0. 


Express z, and z, as functions of 2, y, 2. 


3. Systems or Functions, TRANSFORMATIONS, AND MaPPines 


1. General Remarks. 


The results we have obtained for implicit functions now enable 
us to consider systems of functions, that is, to discuss several 
functions simultaneously. In this section we shall consider the 
particularly important case of systems where the number of 
functions is the same as the number of independent variables. 
We begin by investigating the meaning of such systems in the 
case of two independent variables. If the two functions 


£= g(x,y) and = #(z, y) 


are both differentiable in a region R of the xy-plane, we can inter- 
pret this system of functions in two different ways. The first 
interpretation (the second will be given in sub-section 2, p. 138) 
is by means of a mapping or transformation. To the point P with 
co-ordinates (x, y) in the zy-plane there corresponds the image 
point II with the co-ordinates (€, 7) in the én-plane. 

An example of such a mapping is the affine mapping or trans- 


formation Bice aot by 
n = ox + dy 


of Chapter I (p. 28), where a, }, c, d are constants. 

Frequently (x, y) and (é, y) are interpreted as points of one 
and the same plane. In this case we speak of a mapping of the 
xy-plane on itself, or a transformation of the xy-plane into uself.* 

* It is also possible to interpret a single function £ = f(x) of a single vari- 
able as a mapping, if we think of a point with co-ordinate 2 on an z-axis as 


being brought by means of the function into correspondence with a point é 
{Continued overleaf. 
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The fundamental problem connected with a mapping is that 
of its inversion; that is, the question whether and how z and y 
can in virtue of the equations = ¢(z, y) and 4 = (az, y) be 
regarded as functions of and 7, and how these inverse functions 
are to be differentiated. 

If when the point (x, y) ranges over the region R its image 
point (&, 7) ranges over a region B of the €y-plane, we call B 
the image region of R. If two different points of R always correspond 
to two different points of B, then for each point of B we can always 
find a single point of R& of which it is the image. Thus to each point 
of B we can assign the point of R of which it is the image. 
(This point of # is sometimes called the “ model”, as opposed 
to the “image ”’.) That is, we can invert the mapping uniquely, 
or determine x and y uniquely as functions 


w= g(f,n), y=hé, n) 


of € and », which are defined in B. We then say that the original 
mapping can be uniquely imverted, or has a unique inverse, or is 
a one-to-one * mapping, and we call «= g(f, 7), y= A(é, 7) 
the transformation iverse to the original transformation or 
mapping. 

If in this mapping the point P with co-ordinates (x, y) de- 
scribes a curve in the region R, its image point will likewise 
describe a curve in the region B, which is called the image curve 
of the first. For example, the curve x= c, which is parallel to 
the y-axis, corresponds to a curve in the £y-plane which is given 
in parametric form by the equations 


g= Pe, Y)s 1 Hos p(c, y), 


where y is the parameter. Again, to the curve y = k there corre- 
sponds the curve 


é = d(a, k), = (a, k). 


If to c and & we assign sequences of neighbouring values ¢,, cy, 
Cs,... and hy, kg, ky, ..., then the rectangular “ co-ordinate 


on a é-axis. By this point-to-point correspondence the whole or a part of the 
x-axis is mapped on the whole or a part of the ¢-axis. A uniform “ scale” of 
equidistant z-values on the z-axis will in general be expanded or contracted 
into a non-uniform scale of é-values on the £-axis. The é-scale may be regarded 
as a representation of the function é = f(z). Such a point of view is frequently 
found useful in applications (e.g. in nomography). 

* Often written (1, 1). 
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net” consisting of the lines «= const. and y= const. (e.g. 
the network of lines on ordinary graph paper) usually gives rise 
to a corresponding curvilinear net of curves in the &y-plane 


? 


& es 
Fig. 6 Fig. 7 


Nets of curves x = const. and y = const. in the wy-plane and the éy-plane 


(figs. 6, 7). The two families of curves composing this net of 
curves can be written in implicit form. If we represent the inverse 
mapping by the equations 


t= 9&7), y= ME, 9), 
the equations of the curves are simply 
9g n)=e and AE, n)=k 
respectively. 


In the same way, the two families of lines = y and y= « 
in the éy-plane correspond to the two families of curves 


Ht, Y= y, H(z, y)=« 
in the zy-plane. 


As an example we consider inversion, or the mapping by rectprocal 
radit or reflection in the unit circle. This transformation is given by the 
equations 

sO) a8 

apy ary 
To the point P with co-ordinates (z, y) there corresponds the point II 
with co-ordinates (2, 7) lying on the same line OP and satisfying the 


or OIT = ae so that the radius vector to P 


equation 2 + 743 = z + 7 


is the reciprocal of the radius vector to II. Points inside the unit circle 
are mapped on points outside the circle and vice versa. 


From the relation &? + 7? = - 1 


we find that the inverse tranafor- 
Ree a+ y? 
mation +8 
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‘aie 

which ts again inversion. 
For the region R we may take the whole zy-plane with the exception 
of the origin, and for the region B we may take the whole &y-plane with 
the exception of the origin. The lines § = ¢ and 7 = k in the &y-plane 


correspond to the circles a¢-+y?—12=0 and A+ y—Ty=0 in 
c 


ere he 
oS Ba 


the zy-plane respectively; at the origin these circles touch the y-axis 
and the x-axis respectively. In the same way, the rectilinear co-ordinate 
net in the zy-plane corresponds to the two families of circles touching the 
&-axis and the y-axis respectively at the origin. 


> ar 
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Fig. 8.—-Orthogonal families of rectangular hyperbolas 


As a further example we consider the mapping 
B= at — y®, 4 = Qay. 


The curves & = const. give rise in the xy-plane to the rectangular hyper- 
bolas 22 — y* = const., whose asymptotes are the lines = y and z = —y; 
the lines 7 = const. also correspond to a family of rectangular hyperbolas, 
having the co-ordinate axes as asymptotes. The hyperbolas of each family 
cut those of the other family at right angles (cf. fig. 8). The lines parallel 
to the axes in the zy-plane correspond to two families of parabolas in the 
En-plane, the parabolas 7? = 4c*(c? — &) corresponding to the lines x = ¢ 
and the parabolas 7? = 4c%(c?-+ &) corresponding to the lines y=. 
All these parabolas have the origin as focus and the §-axis as axis (a 
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family of confocal and coaxial parabolas; cf. fig. 9). For systems of 
confocal ellipses and hyperbolas cf. Ex. 5, p. 158. 


Fig. 9.— Orthogonal families of confocal parabolas 


One-to-one transformations have an important interpretation 
and application in the representation of deformations or motions 
of continuously-distributed substances, such as fluids. If we think 
of such a substance as spread out at a given time over a region 
R and then deformed by a motion, the substance originally 
spread over & will in general cover a region B different from 
R. Each particle of the substance can be distinguished at the 
beginning of the motion by its co-ordinates (a, y) in R, and at the 
end of the motion by its co-ordinates (&, 7) in B. The one-to-one 
character of the transformation obtained by bringing (x, y) into 
correspondence with (£, 7) is simply the mathematical expression 
of the physically obvious fact that the separate particles must 
remain recognizable after the ala ie. that separate particles 
remain separate. 


2. Introduction of New Curvilinear Co-ordinates. 


Closely connected with the first: interpretation (as a mapping) 
which we can give to a system of equations § = ¢(z, y), 7 = o(z, y) 
is the second interpretation, as a transformation of co-ordinates in 
the plane: If the functions ¢ and happen, not to be linear, this 


(8912) 
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is no longer an “ affine” transformation, but a transformation 
to general curvilinear co-ordinates. 

We again assume that when (z, y) ranges over a region R of 
the xy-plane the corresponding point (£, y) ranges over a region 
B of the én-plane, and also that for each point of B the corre- 
sponding (a, y) in R can be uniquely determined; in other words, 
that the transformation is one-to-one. The inverse transforma- 
tion we again denote by «= g(€, 7), y = A(é, 7). 

By the co-ordinates of a point P in a region R we can mean 
any number-pair which serves to specify the position of the point 
P in R uniquely. Rectangular co-ordinates are the simplest case 
of co-ordinates which extend over the whole plane. Another: 
typical case is the system of polar co-ordinates in the xy-plane, 
introduced by the equations 


f=r=f/(?+y), 
9 = 6= arc tan (y/z) (0S 6 < 2m). 


When we are given a system of functions = ¢(z, y), 
7 = 7(z, y) as above, we can in general assign to each point P 
(x, y) the corresponding values (£, 7) as new co-ordinates. For 
each pair of values (€, 7) belonging to the region B uniquely 
determines the pair (#, y), and thus uniquely determines the 
position of the point P in R; this entitles us to call ¢, n the co- 
ordinates of the point P. The “co-ordinate lines” € = const. 
and 7 = const. are then represented in the zy-plane by two 
families of curves, which are defined implicitly by the equations 
¢(z, y) = const. and (x, y) = const. respectively. These co- 
ordinate curves cover the region R with a co-ordinate net (usually 
curved), for which reason the co-ordinates (£, ) are also called 
curvilinear co-ordinates in R. 

We shall once again point out how closely these two inter- 
pretations of our system of equations are interrelated. The 
curves in the éy-plane which in the mapping correspond to 
straight lines parallel to the axes in the zy-plane can be directly 
regarded as the co-ordinate curves for the curvilinear co-ordinates 
a= 9(f, 4), y= A(E, n) in the &y-plane; conversely, the co- 
ordinate curves of the curvilinear co-ordinate system € = ¢(z, y), 
n = (a, y) in the zy-plane in the mapping are the images of the 
straight lines parallel to the axes in the €y-plane. Even in the 
interpretation of (£, 7) as curvilinear co-ordinates in the xy-plane 
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we must consider a £n-plane and a region B of that plane in which 
the point with the co-ordinates (£, n) can vary, if we wish to keep 
the situation clear. The difference is mainly in the point of view.* 
If we are chiefly interested in the region R of the zy-plane, we 
regard £, » simply as a new means of locating points in the 
region R, the region B of the &y-plane being then merely sub- 
sidiary; while if we are equally interested in the two regions 
R and B in the zy-plane and the én-plane respectively, it is 
preferable to regard the system of equations as specifying a cor- 
respondence between the two regions, that is, a mapping of one 
on the other. It is, however, always desirable to keep the two 
interpretations, mapping and transformation of co-ordinates, 
both in mind at the same time. 


If, for example, we introduce polar co-ordinates (r, 6) and interpret r 
and § as rectangular co-ordinates in an r0-plane, the circles r = const. 
and the lines 6 = const. are mapped on straight lines parallel to the axes 
in the r6-plane. If the region R of the zy-plane is the circle 27 + y? $1, 
the point (r, 6) of the 76-plane will range over a rectangle 0 Sr Sl, 
0 <6 < 2x, where corresponding points of the sides 6 = 0 and 6 = 2r 
are associated with one and the same point of R and the whole side r= 0 
is the image of the origin « = 0, y = 0. 

Another example of a curvilinear co-ordinate system is the system of 
parabolic co-ordinates. We arrive at these by considering the family of 
confocal parabolas in the zy-plane (cf. also p. 126 and fig. 9) 


2 ?) 
y 2p a+op 


all of which have the origin as focus and the z-axis as axis. 
Through each point of the plane there pass two parabolas of the family, 
one corresponding to a positive parameter value p = & and the other to 
a negative parameter value p = 7. We obtain these two values by solving 
for p the quadratic equation which results when in the equation 
y* = 2p(x+ p/2) we substitute the values of x and y corresponding to the 
point; this gives 


Bs mat (ety), n= —e—+/(e + x). 


These two quantities may be introduced as curvilinear co-ordinates in the 
zy-plane, the confocal parabolas then becoming the co-ordinate curves. 
These are indicated in fig. 9, if we imagine the symbols (2, y) and (&, 7) 
interchanged. 


* There is, however, a real difference, in that the equations always define 
& mapping, no matter how many points (z, y) correspond to one point (£, 7), 
while they define a transformation of co-ordinates only when the correspondence 
is one-to-one. 
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In introducing parabolic co-ordinates (&, ») we must bear in mind that 
the one pair of values (&, y) corresponds to the two points (x, y) and (x, —y) 
which are the two intersections of the corresponding parabolas. Hence 
in order to obtain a one-to-one correspondence between the pair (2, y) and 
the. pair (&, ») we must restrict ourselves to the half-plane y = 0, say. 
Then every region & in this half-plane is in a one-to-one correspondence 
with a region B of the &y-plane, and the rectangular co-ordinates (&, 7) 
of each point in this region B are exactly the same as the parabolic co- 
ordinates of the corresponding point in the region R. 


3. Extension to More than Two Independent Variables. 


In the case of three or more independent variables the state 
of affairs is analogous. Thus a system of three continuously- 
differentiable functions 


f= d(x, Y; 2), y= (x, y; 2), [= x(x, ¥y; 2), 


defined in a region R of xyz-space, may be regarded as the mapping 
of the region R on a region B of énf-space. If we assume that 
this mapping of R on B is one-to-one, so that for each image 
point (£, 7, 2) of B the co-ordinates (x, y, z) of the corresponding 
point (“‘ model ” point) in & can be uniquely calculated by means 
of functions 


+= its UD Q), y= A(é, Up ), z= Ké, up ¢), 


then (€, 7, £) may also be regarded as general co-ordinates of 
the point P in the region R. The surfaces € = const., 7 = const., 
¢ = const., or, in other symbols, 


$(a, ¥y, 2) = const., (a, ¥, 2) = const., x(z, ¥ z) = const. 


then form a system of three families of surfaces which cover 
the region R and may be called curvilinear co-ordinate sur- 
faces. 

Just as in the case of two independent variables, we can in- 
terpret one-to-one transformations in three dimensions as de- 
formations of a substance spread continuously throughout a 
region of space. 

A very important case of transformation of co-ordinates is 
given by polar co-ordinates in space. These specify the position 
of a point P in space by three numbers: (1) the distance 
r= 4/(2 + y?-+ 2%) from the origin, (2) the geographical longi- 
tude ¢, that is, the angle between the 2z-plane and the plane 
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determined by P and the z-axis, and (3) the polar distance 6, 
that is, the angle between 
the radius vector OP and 
the positive z-axis. As we 
see from fig. 10, the three 
polar co-ordinates r, d, 9 are 
related to the rectangular 
co-ordinates by the equations 
of transformation 


x= rcos¢ sin 8, 
y= r sing sin8, 
z= 7 c0s8, 


Fig. 10.—Three-dimensional polar co-ordinates 


from which we obtain the inverse relations 


= 7 (2+ ¥+ 2), 
¢ = are cos ——_"-___ = are sin ners Sree 
Ve + ¥) V (2? + ¥?) 
6 = arc cos ore sin Vet + 9) 


—$$—__—___—____. = arc Sooo; 
V (2? + y? + 2) Vet ¥+ 2) 
For polar co-ordinates in the plane the origin is an exceptional 


ose ls Wx 
XQ 


the angle is indeterminate 
there. In the same way, for 
polar co-ordinates in space the 
whole of the z-axis is an ex- 
ception, since the longitude ¢ 
is indeterminate there. At the 
origin itself the polar distance 
@ is also indeterminate. 

The co-ordinate surfaces 
for three-dimensional polar co- 
ordinates are as follows: (1) for  Fis- ‘det eee 
constant values of r, the con- 
centric spheres about the origin; (2) for constant values of ¢, 
the family of half-planes through the z-axis; (3) for constant 
values of 0, the circular cones with the z-axis as axis and the 
origin as vertex (fig. 11). 
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Another co-ordinate system which is often used is the system 
of cylindrical co-ordinates. These are obtained by introducing 
polar co-ordinates p, ¢ in the zy-plane and retaining z as the 
third co-ordinate. Then the formule of transformation from 
rectangular co-ordinates to cylindrical co-ordinates are 


= pcosd, 
y = psing, 
2=2 


and the inverse transformation is 


p= VV (a? + ¥*), 

$ = arc cos a See ant Se 
Ve? + ¥’) Ve + y) 

Z2=2 . 


The co-ordinate surfaces p= const. are the vertical circular 
cylinders which intersect the zy-plane in concentric circles with 
the origin as centre; the surfaces ¢ = const. are the half-planes 
through the z-axis, and the surfaces z= const. are the planes 
parallel to the zy-plane. 


4. Differentiation Formule for the Inverse Functions. 


In many cases of practical importance it is possible to solve 
the given system of equations directly, as in the above examples, 
and thus to recognize that the inverse functions are continuous 
and possess continuous derivatives. For the time being, there- 
fore, let us assume the existence and differentiability of the 
inverse functions. Then without actually solving the equations 
explicitly we can calculate the derivatives of the inverse functions 
in the followmg way. We substitute the inverse functions 
z= g(t, 7), y=(é, ) in the given equations = ¢(z, y), 
n= (x, y). On the right we obtain the compound functions 
O(9(E, ), RE, n)) and Y(g(E, 7), ACE, y)) of € and y; but these 
must be equal to £ and 7 respectively. We now differentiate 
each of the equations 


é= H(9(E, 7); h(é, n))» 
n= W(9(E, 7), AE, n)) 


with respect to & and to y, regarding € and y as independent 
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variables.* If on the right we apply the chain rule for the dif- 
ferentiation of compound functions, we obtain the system of 
equations 

l= $293 + dy hy, O= $29n + dyhis 

0= taGe + Py he, l= a9 + pyh,. 


Solving these equations, we obtain 


or 


ie. the partial derivatives of the inverse functions z= 9(£, 7) 
and y = h(€, 7) with respect to € and », expressed in terms of 
the derivatives of the original functions ¢(z, y) and (x, y) with 
tespect to 2 and y. For brevity we have here written 


D= £nNy va EyNe = 


This expression D, which we assume is not zero at the point in 
question, is called the Jacobian or functional determinant of the 
functions = ¢(2, y) and n = (x, y) with respect to the variables 
xand y. 

In the above, as occasionally elsewhere, we have used the 
shorter notation é(x, y) instead of the more detailed notation 
€= ¢(z, y), which distinguishes between the quantity ¢ and 
its functional expression ¢(7, y). We shall often use similar 
abbreviations in the future when there is no risk of confusion. 


For polar co-ordinates in the plane expressed in terms of rectangular 
co-ordinates, 


E=r=+/(2*+ y*) and n= 0 = are tan’, 


* These equations hold for all values of £ and » under consideration; as 
we say, they hold identically, in contrast to equations between variables which 
are satisfied only for some of the values of these variables. Such identical 
equations or identities, when differentiated with respect to any of the variables 
ace in them, again yield identities, as follows immediately from the 

efinition. 
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for example, the partial derivatives are 


ice, 2 
Viet y) or 4 Ve+y) or 
Oj a Ga 
at + y@ wo By 
Hence the Jacobian has the value 
aan ey y 


and the partial derivatives of the inverse functions (rectangular co- 
ordinates expressed in terms of polar co-ordinates) are 


ae = = 7 — 
os ome %=-Y, n= Yo = 2, 


as we could have found more easily by direct Guterraeation of the inverse 
formule z = r cos@, y = r sin 6. 


The Jacobian occurs so frequently that a special symbol is 
often used for it: 
— E10). 
Of, y)” 


The appropriateness of this abbreviation will soon be obvious. 
From the formule 


for the derivatives of the inverse functions we find that the 
Jacobian of the functions z= 2(é, n) and y= y¥(g, y) with 
respect to € and 7 1s given by the expression 


A(x, y) — = £aNy = EyNe a ag, 7) 
GE 9) 7 On tem SE Da 


That is, the Jacobian of the inverse system of functions is the recip- 
rocal. of the Jacobian of the original system. 

In the same way we can also express the second derivatives 
of the inverse functions in terms of the first and second derivatives 
of the given functions. We have only to differentiate the linear 
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equations given above with respect to ¢ and to » by means of 
the chain rule. (We assume, of course, that the given function 
possesses continuous derivatives of the second order.) We then 
obtain linear equations from which the required derivatives can 
readily be calculated. 

For example, to calculate the derivatives’ 


we differentiate the two equations 
L= §,a+ Eu¥e 
O= Heb + We 
once again with respect to ¢ and by the chain rule obtain 
O= Laat? + Leute ye + Evvye® + Sates + EvVers 
O = Hente + Woy teY_ + Nw Ye? + DeLee + WY ee 


If we solve this system of linear equations, regarding the quantities 
a, and y,, as unknowns (the determinant of the system is again D, 
and therefore, by hypothesis, not zero) and then replace «; and ¥; 
by the values already known for them, a brief calculation gives 


lp = — i anNy? — 2ey Ney + Ew Ne éy 
BD Nowy" — Wey Ney + Now Nae Ny 
and 
Y= i Exaty? — 2b ey ty + Ewe bn , 
DF ne2ty® — 2Ney Ney + Ny Ne? Ne 


The third and higher derivatives can be obtained in the same 
way, by repeated differentiation of the linear system of equations; 
at each stage we obtain a system of linear equations with the 
(non-vanishing) determinant D. 


5. Resolution and Combination of Mappings and Transformations. 


In Chapter I we saw that every affine transformation can be 
analysed into simple or, as we say, primitive transformations, the 
first of which deforms the plane in one direction only and the 
second deforms the already deformed plane again in another 
direction. In each of these transformations there is really only 
one new variable introduced. 
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We can now do exactly the same thing for transformations 
in general. 

We begin with some remarks on the combination of trans- 
formations. If the transformation 


é= dz, y), n= H(z, y) 


_ gives a one-to-one mapping of the point (z, y), which ranges over 
a region R, on the point (, 7) of the region B in the éy-plane, 
and if the equations 


a= DE, 9); v= Pg, 7) 


give a one-to-one mapping of the region B on a region R’ in the 
uv-plane, then a one-to-one mapping of R on R’ simultaneously 
occurs. This mapping we naturally call the resultant mapping or 
resultant transformation, and say that it is obtained by combining 
the two given mappings. The resultant transformation is given 
by the equations 


u= O(¢(z, y), Ha, ¥))s C= ¥(¢(e, y), (a, y)); 


from the definition it follows at once that this mapping is one-to- 
one. 
By the rules for differentiating compound functions we obtain 


Ou 
Ox cf vee 


a 
iy = ®,¢, + Db» 


ov 
on ae Vide + ©, Was 


ov 

ay —= Vidy + Phy 
On comparing this with the law for the multiplication of deter- 
minants (cf. p. 36) we find * that the Jacobian of u and v with 
respect to x and y is 


Oudv dudv 


on dy ay Oe = (OY, om 0,1) (Patpy a brie)» 


* The same result can of course be obtained by straightforward muliipli. 
cation. 
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In words: 


The Jacobian of the resultant transformation is equal to the 
product of the Jacobians of the individual transformations. 


In symbols: 
Ou, v) __ Ou, ¥) ag, m). 
O(z,y) 9(€, n) O(a, y) 


This equation brings out the appropriateness of our symbol for 
the Jacobian. When transformations are combined, the Jacobians 
behave in the same way as the derivatives behave when functions of 
one variable are combined. The Jacobian of the resultant transfor- 
mation differs from zero, provided the same is true for the in- 
dividual (or component) transformations. 

If, in particular, the second transformation 


u= Of, 7), v= ¥(E, 7) 
is the inverse of the first, 
g= f(z, y): = 2(x, y)s 


and if both transformations are differentiable, the resultant 
transformation will simply be the identical transformation, that 
is, w= 2, v=y. The Jacobian of this last transformation is 
obviously 1, so that we again obtain the relation of p. 144, 


(En) Ae 9) _ 4 
ale, y) HE, 2) 


From this, incidentally, it follows that neither of the two Jacobians 
can vanish. 

Before we take up the question of the resolution of an 
arbitrary transformation into primitive transformations, we 
shall consider the following primitive transformation: 


é= 4(z, y) n=y. 


We assume that the Jacobian D= ¢, of this transformation 
differs from zero throughout the region R, i.e. we assume that 
¢, > 0, say, in the region. The transformation deforms the 
region R into a region B; and we may imagine that the effect 
of the transformation is to move each point in the direction of 
the z-axis, since the ordinate is unchanged. After deformation 
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the point (x, y) has a new abscissa which depends on both z 
and y. The condition ¢,>0 means that when y is fixed 
€ varies monotonically with z. This ensures the one-to-one 
correspondence of the points on a line y= const. before and 


Fig. 12.—Transformation in which the’ sense of rotation is preserved 


after the transformation; in fact, two points P(z,, y) and Q(z», y) 
with the same ordinate y and 7, > 2, are transformed into two 
points P’ and Q’ which again have the same ordinate and whose 
abscisse satisfy the inequality £ > &, (cf. fig. 12). This fact also 


Fig. 13.—Transformation in which the sense of rotation is reversed 


shows that after the transformation the sense of rotation is the 
same as that in the xy-plane. 

‘If $, were negative, the two points P and @ would corre- 
spond to points with the same ordinate and with abscisse &, 
and é,, but this time we should have é, > &, (cf. fig. 13). The sense 
of rotation would therefore be reversed, as we have already seen 
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in Chapter I (p. 35) for the simple case of affine ¢ransforma- 
tions. of 
If the primitive transformation 


= ¢(%,y) n=y 


is continuously differentiable, and its Jacobian 4, differs from 
zero at a point P(x», Yo), then in a neighbourhood of P the trans- 
formation has a unique inverse, and this inverse is also a primitive 
transformation of the same type. In virtue of the hypothesis 
dx = 0 we can apply the theorem on implicit functions given in 
section 1, No. 3 (p. 114), and thus find that in a neighbourhood 
of (a; Yo) the equation ¢ = ¢(z, y) determines the quantity 
uniquely as a continuously differentiable function + = g(€, y) of 
éand y.* The two formule 


z= én), y= 


therefore give us the inverse transformation, whose determinant 
is = 1/$e + 9. : 
.. Tf we now think of the region B in the éy-plane as itself 
mapped on a region & in the wv-plane by means of a primitive 
transformation 

a= é, .= FE, 7), 


where we assume that ¥’, is positive, the state of affairs is just as 
above, except that the deformation takes place in the direction 
of the other co-ordinate. This transformation likewise preserves 
the sense of rotation (or reverses it if the relation ‘’, < 0 holds 
- instead of ‘Y, > 0). ee: 7 

By combining the two primitive transformations we obtain 
the transformation st! 


u= d(x, y), 
v= Yd, y); y) pol p(2, Y)s 


and from the theorem on Jacobians we see that 


* Here we use the fact that a function with two continuous derivatives is 
differentiable. — a na 
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We now assert that an arbitrary one-to-one continuously 
differentiable transformation 


“= d(z, y), v= p(x, y) 


of the region R in the zy-plane on a region R’ in the uv-plane 
can be resolved in the neighbourhood of any point interior to R 
into continuously differentiable primitive transformations, pro- 
vided that throughout the whole region R the Jacobian 


O(u, v) _ i 
a L, y) Paty dubs 


differs from zero. 

From the non-vanishing of the Jacobian it follows that at 
no point can we have both ¢, = 0 and ¢,= 0. We consider a 
point with co-ordinates (x, yp) and assume that at that point 
¢2 +0. Then by the main theorem of section 1, No. 5 (p. 117) 
we can mark off intervals 7, S¢ Sa, y Sy S yoy Sus 
about 2, Yo, and uo = U(%p, Yo) respectively, in such a way that 
within these bounds the equation u= ¢(x, y) can be solved 
uniquely for 7 and defines x = g(u, y) as a continuously differen- 
tiable function of u and y. If we substitute this expression in 
v= (a, y), we obtain v= Y(g(u, y), y) = Vu, y). Hence in 
any neighbourhood of the point (x9, Y¥p) we may regard the given 
transformation as composed of the two primitive transformations 


é= $(2, y), n=y 
u=€ v= P(E, »). 


Similarly, in a neighbourhood of a point (9, yo) at which 
¢y + 0 we can resolve the given transformation into two pmmi- 
tive transformations of the form 


f= 42, n= $(2, y) 
u=y, v=Vé,9) (= pfx, y(u, a) }). 


This pair of transformations is not exactly identical in form with 
the pairs considered above, each of which leaves one of the co- 
ordinate directions unaltered. It can easily be brought into 
that form, however, by interchanging the letters u and v (this 
interchange is itself the resultant of three very simple primitive 
transformations (cf. the footnote on p. 31)). For the purposes of 


and 
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the present chapter, however, it is more convenient not to carry 
out this resolution; instead, we write the last set of equations in 
the form 

f=@, = $(x, y), 

= —PE, n), V=% 

u= 3, v= —4U. 


These last represent two primitive transformations, each affecting 
one co-ordinate direction only, and also a rotation of the axes 
in the we-plane through an angle of 90°. The rotation is so easy 
to deal with that it need not be split up into primitive trans- 
formations. 

It is not to be expected that we can resolve a transformation 
into primitive transformations in one and the same way through- 
out the whole region. Since, however, one of the two types of 
resolution can be carried out for every interior point of R, every 
closed region interior to R can be subdivided into a finite number 
of sub-regions* in such a way that in each sub-region one of the 
resolutions is possible. 

From the possibility of this resolution into primitive trans- 
formations we can draw an interesting conclusion. We have seen 
that in the case of a primitive transformation the sense of rotation 
is reversed or preserved according as the Jacobian is negative or 
positive. From this it follows that in the case of general trans- 
formations the sense of rotation is reversed or preserved according 
as the sign of the Jacobian is negative or positive. Yor if the sign 
of the Jacobian is positive, when the resolution into primitive 
transformations is carried out the Jacobians of the primitive 
transformations will either be both positive or both negative. (The 
rotation of the u- and v-axes through 90°, required in some cases, 
has -+1 for its Jacobian and leaves the sense of rotation un- 
changed, and accordingly does not affect the discussion at all.) In 
the first case it is obvious that the sense of rotation is preserved; 
in the second case this follows from the fact that two reversals 
of the sense bring us back to the original sense. If the Jacobian 
is negative, however, one, and only one, of the primitive trans- 
formations will have a negative Jacobian and will therefore 
reverse the sense, while the other will not affect it. 


* This follows from the covering theorem (cf. p. 99). 
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6. General Theorem on the Inversion of Transformations and 
Systems of Implicit Functions. 


The possibility of inverting a transformation depends on the 
following general theorem: 

If in the neighbourhood of a point (Xo, Yo) the functions (x, y) 
and (x, y) are continuously differentiable,* and ug = (Xo, Yo); 
Vo = ¥(Xq, Yo), and if im addition the Jacobian D = ¢,x), — dyip, 
is not zero at (Xp, Yo), then in a neighbourhood of the point 
(Xp, Yo) the system of equations u= ¢(x, y), v= ¥(x, y) has a 
unique inverse; that is, there is a uniquely determined pair of 
functions 7 = g(u, v), y = h(u, v) such that 2, = g(uo, v9) and 
Yo = A(uo, Vg) and also the equations 


“= P(g(u, v), hu, v)) and v= (g(u, v), hu, »)) 


hold in some neighbourhood of the point (to, v9). 

In the neighbourhood of (ug, Vo) the so-called inverse functions 
x= g(u, v), y = h(u, v) possess continuous derivatives which are 
given by the expressions 


eg 2. Hi 
ou Dy’ Doy 
dy __ __ 1 a oy sd 


du Dae” Ce" 

The proof follows from the discussions in No. 5 (p. 149). For 
in a sufficiently small neighbourhood of the point (a, yo) we can 
resolve the transformation u= (x, y), v= (x, y) into continu- 
ously differentiable primitive transformations, possibly with a rota- 
tion of the u- and v-axes through 90° in addition. Each of these 
has a unique inverse, which is itself a continuously differentiable 
transformation. The combination of these inverse transformations 
at once gives us the transformation which is the inverse of the given 
one. This, being a combination of continuously differentiable trans- 
formations, is itself continuously differentiable. It then follows 
from No. 4 (p. 143) that the differentiation formule hold as stated. 

This inversion theorem is a special case of a more general 
theorem which may be regarded as an extension of the theorem 
of implicit functions to systems of functions. The theorem of 


* Le. are continuous and possess continuous derivatives. 
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implicit functions (section 1, p. 117) applies to the solution of one 
equation for one of the variables. The general theorem is as 
follows: 

If $(x, y, u, V,..., w) and ox, y, u, Vv, ..., W) are con- 
tinuously differentiable functions of x, y, U, V, +++» W, and the 
equations 


A(a, Y, U,V,...,0)=0 and (x,y, u,%,...,w)=90 


are satisfied by a certain set of values Xp, Yo: Uo, Vor -- + » Wo, and 
if in addition the Jacobian of 4 and x with respect to x and y differs 
from zero at that point (that is, D = ¢,, — dy, = 0), then in 
the neighbourhood of that point the equations 6= 0 and p= 0 
can be solved in one, and only one, way for x and y, and this solution 
gives x and y as continuously differentiable functions of u,v, .. . , W- 

The proof of this theorem is similar to that of the inversion 
theorem above. From the assumption that D + 0 we can conclude 
without loss of generality that at the point in question ¢, + 0. 
Then by the main theorem of section 1 (p. 117), if we restrict 
L,Y, U, v,..., w to sufficiently small intervals about %o, Yo, Uo, 
Ug) . . - , Wy respectively, the equation ¢(z, y, u, v,..., w) can be 
solved in exactly one way for x as a function of the other variables, 
and this solution z= g(y, u, v, ..., w) is a continuously differ- 
entiable function of its arguments, and has the partial derivative 
Iv = —¢y/ba- If we substitute this function x = g{y, u, ¥,..., ¥) 
in p(x, y, U, v,..., w), We obtain a function P(x, y, U, 0, ..., W) 
= Py, u, v,..., wv), and 

y= pF + fy = Lae 
e de 

Hence in virtue of the assumption that D =: 0 we see that the 
derivative WY, is not zero. Thus if we restrict y, u, v,..., w to 
intervals about yo, %o, Up, - - - » Wo (which we take to be smaller 
than the intervals to which they were previously restricted), we 
can solve the equation Y= 0 in exactly one way for y as a 
function of u, v, ..., w, and this solution is continuously dif- 
ferentiable. Substituting this expression for y in the equation 
x= gly, U, v,..., w) now gives @ as a function of u, v,..., ¥, 
and this solution is continuously differentiable and unique, 
subject to the restriction of x, y, u, v,..., w to sufficiently small 
intervals about 2, Yo, Ug, Vy - +» » Wo respectively. 
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7. Non-independent Functions. 


It is worth mentioning that if the Jacobian D vanishes at a 
point (x, Yo), no general statement can be made about the 
possibility of solving the equations in the neighbourhood of 
that point. Even if the inverse functions do happen to exist, 
however, they cannot be differentiable, for then the product 


O$, 1) (x, y) would vanish, while by p. 147 it must be equal 
O(a, y) O(€, 9) 
to 1. 


For example, the equations 


ua, oxy 
can be solved uniquely, the solutions being 
t= “/u, y= ¥, 


although the Jacobian vanishes at the origin; but the function VA « is not 
differentiable at the origin. 
On the other hand, the equations 
u= e— yy y= Dry 


cannot be solved uniquely in the neighbourhood of the origin, since the 
two points (x, y) and (—2, —y) of the zy-plane both correspond to the 
same point of the uv-plane. 


If, however, the Jacobian vanishes tdentically, that is, not 
merely at the single point (x, y), but at every point in a whole 
neighbourhood of the point (x, y), then the transformation is of 
the type called degenerate. In this case we say that the functions 
u= g(x, y) and v= g(x, y) are dependent. We first consider 
the special, almost trivial, case in which the equations ¢, = 0 
and ¢,= 0 hold everywhere, so that the function ¢(z, y) is a 
constant. 

We then see that while the point (2, y) ranges over a whole 
region its image (uv, v) always remains on the line «== const. 
That is, our region is mapped only on a line, instead of on a 
region, so that there is no possibility here of speaking of a one- 
to-one mapping of two two-dimensional regions on one another. 
A similar situation arises in the general case in which at least 
one of the derivatives ¢, or ¢, does not vanish, but the Jacobian 
D is still zero. We suppose that at a point (2, Yo) of the region 
under consideration we have ¢,+0. It is then possible to 
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resolve our transformation into two primitive transformations 
é= $l, y), n=y and u=é, v= (é, 7) just as in No. 5 
(p. 150), for there we made use only of the assumption ¢, + 0. 
In virtue of the equation D= ¢4,,= 0, however, 4, must 
be identically zero in the region where ¢,+-0; that is, the 
quantity b= v does not depend on 7 at all, and v is a function 
of €=ualone. Our result is therefore as follows: 

If the Jacobian of the transformation vanishes identically, a 
region of the xy-plane is mapped by the transformation on a curve 
in the uv-plane instead of on a region, since in a certain interval 
of values of w only one value of v corresponds to each value 
of u. Thus tf the Jacobian vanishes identically the functions are 
not independent, 1.e. a relation 


F(g, $) = 0 


evists which is satisfied for all systems of values (x, y) in the above- 
mentioned region. For if F(u, v) = 0 is the equation of the curve 
in the wv-plane on which the region of the zy-plane is mapped, 
then for all points of this region the equation 


F(¢(z, y), ¥(z, y)) = 9 


is satisfied, i.e. this equation is an identity in x and y. 

The exceptional case discussed separately at the beginning 
is obviously included in this general statement. The curve in 
question is then just the curve u = const., which is a parallel to 
the v-axis. 

An example of a degenerate transformation is 

E=aty n=(et+y) 


According to this transformation all the points of the zy-plane are mapped 
on the points of the parabola 7 = & in the &y-plane. An inversion of the 
transformation is out of the question, for all the points of the line x + y 
= const, are mapped on a single point (&, 7). As we can easily verify, the 
value of the Jacobian is zero. The relation between the functions & and », 
in accordance with the general theorem, is given by the equation 
FE, n) = B— y= 0. 


8. Concluding Remarks. 


The generalization of the theory for three or more independent 
variables offers no particular difficulties. The chief difference 
is that instead of the two-rowed determinant D we have deter- 
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minants with three or more rows. In the case of transformations 
with three independent variables, 


€ = d(2, y 2), n= o(z, y; 2), [= x(x, y 2), 
c= 9(€, Ns Q), y= h(é, Up é), 2= Ké, y; Q), 


the Jacobian is given by the equation 


pate o_|% Ye % 
a(z, y 2) — | Py by Xu}e 
ds we, Xz 


In the same way, for transformations 


£¢= $2, Le, .~ , Lp) i=] 2 
m= gly bp-s-sf) 9 


with n independent variables the Jacobian is 


-)% 


aé,, £5, eaey én) —_ Od, Ody on Odbn ; 


O(a4, Lg, +++» Ly) Oa,’ Ory’ “"°’ Oly 
Od, od Opn 
02, Oy. ” Oty 


For more than two independent variables it is still true that 
when transformations are combined the Jacobians are multiplied 
together. In symbols, 


ag, ba, sees £n) . Am, Nay s++9 Nn) a Hg, £2, ceey En) 
On, Nar ++ +> Nn) A(x, Yq, +++, Xn) O(a, Loy -++5 Ln) 


In particular, the Jacobian of the inverse transformation is the 
reciprocal of the Jacobian of the original transformation. 

The theorems on the resolution and combination of trans- 
formations, on the inversion of a transformation, and on the 
dependence of transformations remain valid for three and more 
independent variables. The proofs are similar to those for the 
case n= 2; to avoid unnecessary repetition we shall omit them 
here. 

In the preceding section we have seen that the behaviour of 
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a general transformation in many ways resembles that of an 
affine transformation, and that the Jacobian plays the same part 
as the determinant does in the case of affine transformations. 
The following remark makes this even clearer. Since the functions 
é= (gz, y) and » = (2, y) are differentiable in the neighbour- 
hood of (a9, Yo), we can express them in the form 


E— Ey = (& — %q) palZq, Yo) + (Y — Yo) PulXo; Yo) 
+ Ve aF+U— vor 


1 — No = (& — Lo) PalZo Yo) + (Y — Yo) Pu(Xo» Yo) 
+ 8V(e— a)? + (y — yo)’, 
where « and & tend to zero with +/{(a— %)* + (y — Yo)*}. 
This shows that for sufficiently small values of | *—2,| and 
| ¥ — ¥o| the transformation may be regarded, to a first approxi- 
mation, as affine, since it can be represented approximately by 
the affine transformation 


E= 4+ (& — 2y)balZq, Yo) + (Ly — Yo)Pu(Zo> Yo)s 
1 = No + (& — Lo)PalXo, Yo) + (Y — Yo)bu(Tor Yo)s 


whose determinant is the Jacobian of the original transformation. 


EXAMPLES 
1. If f(z)isa continuously differentiable function, then the transformation 
u= f(x), v= —y + xf(z) 


has a single inverse in every region of the zy-plane in which f’(z) + 0. 
The inverse transformation has the form 


z= g(u), y= —o + ug(u). 


2. A transformation is said to be “conformal” (see p. 166) if the 
angle between any two curves is preserved. 
(a) Prove that the inversion 
Se ie 
S= ate ate 
is a conformal transformation. 
(b) Prove that the inverse of any circle is another circle or a straight 
line. 
(c) Find the Jacobian of the inversion. 
3. Prove that in a curvilinear triangle which is formed by three circles 
passing through one point O, the sum of the angles is x. 
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4, A transformation of the plane 
u= o(z, y), v= (2, y) 
is conformal if the functions ¢ and ) satisfy the identities 


Pa = Vy Py = — Var 
5. The equation 


aa ae (a > b) 


determines two values of t, depending on x and y: 


4= A(x, Y)» 
t, = u(x, y). 


(a) Prove that the curves ¢, = const. and f, = const. are ellipses and 
hyperbolas all having the same foci (confocal conics). 

(b) Prove that the curves ¢, = const. and ¢, = const. are orthogonal. 

(c) , and ¢, may be used as curvilinear co-ordinates (so-called “focal” 
co-ordinates). Express 2 and y in terms of these co-ordinates. 

a > . 

(d) Express the Jacobian ar 5) in terms of x and y. 

(e) Find the condition that two curves, which are represented para- 
metrically in the system of focal co-ordinates by the equations 


4 = f,(0), & = f(a) and t= 9i(u), t= gol) 


are orthogonal to one another. 
6. (a) Prove that the equation in ¢ 


co ¥ a 
Ao ba ee te ® 


has three distinct real roots ¢,, t, ts, which lie respectively in the intervals 
—-wct<uc<ct<bbet<ca, 


provided that the point (2, y, z) does not lie on a co-ordinate plane. 

(b) Prove that the three surfaces t+, = const., ¢, = const., t, = const. 
passing through an arbitrary point are orthogonal to one another. 

(c) Express 2, y, z in terms of the “ focal co-ordinates ” t,, ta, ts. 


7. Prove that the transformation of the zy-plane given by the equations 


x y 
a t(s+ 59 ay = a(v- 54 a) 
(a) is conformal; 
(b) transforms straight lines through the origin and circles with the 
origin as centre in the zy-plane into confocal conics ¢ = const. given by 
B2 a 


if4ci=go 
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8. Inversion in three dimensions is defined by the formulx 
bea hen Us eae 
“Bp pe ese apy ee OO Pye 
Prove that 


(a) the angle between any two surfaces is unchanged; 
(6) spheres are transformed either into spheres or into planes. 


9. Prove that if all the normals of a surface z= u(x, y) meet the 
z-axis, then the surface is a surface of revolution. 


4, APPLICATIONS 


1. Applications to the Theory of Surfaces. 


In the study of surfaces, as in that of curves, parametric 
representation is frequently to be preferred to other types of 
representation. Here we need two parameters instead of one; 
we denote them by u and v. A parametric representation may be 
expressed in the form 


c= Gu, v), y= $(u, o), 2= x(u, 2), 


where ¢, y, and x are given functions of the parameters u and v 
and the point (wu, v) ranges over a given region R in the w-plane. 
The corresponding point with the three rectangular co-ordinates 
(x, y, 2) then ranges over a configuration in zyz-space. In general 
this configuration is a surface, which can be represented in the 
form z= f(x, y), say. For we can seek to solve two of our three 
equations for « and in terms of the two corresponding rect- 
angular co-ordinates. If we substitute the expressions thus found 
for u and v in the third equation, we obtain an unsymmetrical 
representation of the surface, z= f(x, y), say.* Hence in order 
to ensure that the equations really do represent a surface, we 
have only to assume that the three Jacobians 


Pu by Pu ty Xu Xo 
thy po Xu Xe : Pu dy 


do not all vanish at once; in a single formula, that 
(Puthy — doth)? + (}uXe = toXu)? + (xubo = XoPu)® > 0. 


Then in some neighbourhood of each point in space represented 


2 


* This is actually a special case of the parametric form, as we see by putting 
z=<uandy =». 
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by our three equations it is certainly possible to express one of 
the three co-ordinates uniquely in terms of the other two. 


A simple example of parametric representation is the representation of 
the spherical surface a? + y? + 2% = r° of radius r by the equations 


z=rcosusinvy, y=rsinusiny, z= 7 cose 
(0Su<2n, OSvSn), 


where v = 6 is the polar distance and u = ¢ is the geographical longitude 
of the point on the sphere (cf. p. 141). 

This example exhibits one of the advantages of parametric representa- 
tion. The three co-ordinates are given explicitly as functions of u and v, 
and these functions are single-valued. If v runs from 7/2 to m we obtain 
the lower hemisphere, i.e. z= — V (r* — 2% — y*), while values of v from 
0 to x/2 give the upper hemisphere. Thus with the parametric representa- 
tion it is not necessary, as it is with the representation z= + V (7? — 2% — y*), 
to consider two “ single-valued branches” of the function in order to 
obtain the whole sphere. 

We obtain another parametric representation of the sphere by means of 
stereographic projection. In order to project the sphere 2? -+ 42+ 2—r=0 


Fig. 14.——Stereographic projection of the sphere 


stereographically from the “north pole” (0, 0, r) on the “ equatorial 
plane” z = 0, we join each point of the surface to the north pole N by 
a straight line and call the intersection of this line with the equatorial 
plane the stereographic image of the corresponding point of the sphere 
(fig. 14). We thus obtain a one-to-one correspondence between the points 
of the sphere and the points of the plane, except for the north pole N. 
Using elementary geometry, we readily find that this correspondence is 
expressed by the formule 


en _ ___Srty PE cla alee 
ate te I~ Bete wre’ 

where (u, v) are the rectangular co-ordinates of the image-point in the plane. 

These equations may be regarded as a parametric representation of the 
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sphere, the parameters u and v being rectangular co-ordinates in the 
uv-plane. 


As a further example we give parametric representations of the surfaces 
a yf at 
a’ a 
which are called the hyperboloid of one sheet and the hyperboloid of two 


Fig. 15.—Hyperboloid of one sheet Fig. 16.—Hyperboloid of two sheets 


sheets respectively (cf. figs. 15 and 16). The hyperboloid of one sheet is 
represented by 


e+e? 
a= acosu 5) = @ cos4u coshy, 
. ete? ? Osu<2r 
y = bsinu = bsinw cosh», 
—-e*<v<+o 
e— er ‘ 
z=c = csinhy; 


the hyperboloid of two sheets by 


ev? +e? 
2=a 3 = acosh»y, 
ev — ee? - Osu<2r 
y = bcosu = b cosu sinh», -_ 
2 —w<v<+o 
e? — e~? 
z2=>csinu 2 = csinu sinh». 


In general, we may regard the parametric representation of 
a surface as the mapping of the region R of the uv-plane on the 
7 (8912) 


162 DEVELOPMENTS AND APPLICATIONS  [Cuapr. 


corresponding surface, where, as always, the word mapping is 
understood to mean a point-to-point correspondence. To each 
point of the region R of the wo-plane there corresponds one point 
of the surface, and in general the converse is also true.* 

In the same way, a curve u= u(t), v = v(t) in the uv-plane 
corresponds in virtue of the equations z = ¢(u(t), v(t)) = x(t), ... 
to a curve on the surface (cf. p. 85). In particular, in the 
representation of the sphere by means of polar co-ordinates 
the meridians are represented by the equation w= const. and 
the parallels of latitude by v== const. This net of curves 
thus corresponds to the system of parallels to the axes in 
the wv-plane. 

The representation of a curve on a given surface is one of the 
most important methods for thorough investigation of the proper- 
ties of the surface. Here we shall give only the expression for s, 
the length of arc of such a curve. As we mentioned in Chap. II, 
section 7 (p. 86), we have 


(a) = Ga) + G+ @) 


so that in virtue of the equations 
da 


dt 
we obtain 


Gy i, wy oreo + a(S . 


where for the sake of compactness we have introduced the 
Gaussian fundamental quantities of the surface, 


oy dz \? 
ae =) Bf (2) + z) : 


_ Ox Ou , Oy Oy , 02 Oz 
a du ov 


ox \? ae 2 
dv 
* This, of course, is not always the case. For example, in the representa- 


tion of the sphere by polar co-ordinates (p. 160) the poles of the sphere corre- 
spond to the whole line-segments v = 0 and v = a respectively. 


= 2, wo, &ec., 
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These are independent of the particular choice of the curve on 
the surface, and depend only on the surface itself and its para- 
metric representation. The above expressions for the derivative 
of the length of arc with respect to the parameter are usually 
expressed symbolically by omitting the reference to the parameter 
t and saying that the “line element ” ds on the surface is given 
by the “ quadratic differential form ” 


ds? = Edu? +- 2F dudv + Gd. 


For the direction cosines of the normal to a surface given 
in the form (2, y, z)= 0 we have already obtained (p. 130) 
the expressions 


cosa = erat eee cos B = i 
VOFTOF+ O3) VOF+ OFF OB) 
®, 
or MOET OFT OF 


To obtain these direction cosines in the case of parametric re- 
presentation, we suppose that the surface given by the equations 
c= d¢(u, v), y= Pu, v), = x(u, v) is written in the form 
@(x, y, 2) = 0. The equation 


O(d(u, v), ou, v), x(u, v)) = 0 


is then an identity in wu and v, and by differentiation we 
obtain 


®, 4, + Od, + ®, Xu = 0, 
©, ¢, + O, 4, + ®,x%, = 0. 


From these it follows at once that (cf. Chap. I, section 3, p. 26) 


Dy = pluxe — Xue); Py = p(Xuho — Puxe)s 
®, = p(dupy — Puho)s 


where p is a suitably chosen multiplier. From the definition of 
E, F, G we find by direct expansion that 


(buXv — Xuibo)® + (xubo — Puxe)® + (Putho — Yubo)? = EG — FF, 
and combining this with the preceding equation, we have 
O24 O24 02 = p(EG— FP, 
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Thus we finally obtain the formule for the direction cosines of 
the normal to the surface in the form 


COs a, = PuXe— Xubo cos B = XuPo — $uXe 


/(EG — Fy’  A/(EG — FY 
uy due ue dy dy 
cosy = V(EG — F) 


The equations «= g(t), v = A(t), as we have seen, represent 
@ curve on the surface. The direction cosines of the tangent to 
this curve are given according to the chain rule by the expressions 


jon a ee 
ds dtds 4+/(Hu®+ 2Fu'v' + Gv’) 
= Yt! + yy’ me zyw’ + 2,0 
008B= Tait aFue pay °°’ Wiwap aru tau) 
Here for brevity we have put ~~ ao) = oe Ve v’. If we now 


consider a second curve on the cee given by the equations 
u= g,(t), v= h,(t), whose tangent has the direction cosines cos a,, 
cos B,, cosy,, and if we use the abbreviations 


dgi(t)_,  Aa(t) _ 
Sane ae 


then the cosine of the angle between the two curves is given by the 
cosine of the angle between their tangents, that is, by 


CO8w = cosa cosa, + cos B cos B, + cosy cosy, 
Bui’ + Fav’ + u's) + Gin’ 
ery (Eu? + 2F is +- Go?) 4/(Eu’? + 2Fu'e’ + Gv") 


where all the quantities on the right are to be given the values 
which they have at the point of intersection of the two curves. 

In particular, we may consider those curves on the surface 
which are given by equations w= const. or v = const. If in our 
parametric representation we substitute a definite fixed value 
for u, we obtain a three-dimensional or twisted curve lying on 
the surface and having v as parameter; and a corresponding 
statement holds good if we substitute a tixed value for v and 
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allow u to vary. These curves «= const. and v = const. are the 
parametric curves on the surface. The net of parametric curves 
corresponds to the net of par- 
allels to the axes in the wv-plane 
(fig. 17). 

The mapping of one plane 
region on another may be re- 
garded as a special case of 
parametric representation. For 
if the third of our functions 
x(u, ¥) vanishes for all values of 
uw and v under consideration, then 
as the point (w, v) ranges over its 
given region the point (a, y, z) 
will range over a region in the 
zy-plane. Hence our equations 
merely represent the mapping of 
a region of the uv-plane on a region of the xy-plane; or if we 
prefer to think in terms of transformations of co-ordinates, the 
equations define a system of curvilinear co-ordinates in the wv- 
region, and the inverse functions (if they exist) define a curvi- 
linear wo-system of co-ordinates in the plane zy-region. In terms 
of the curvilinear co-ordinates (u, v) the line element in the 
zy-plane is simply 

ds? = Bdv? + 2Fdudw + Gd, 


ox\? ox\2 
Pra[2 in 
a) 7 a) 

__ On Ox oy oy 
ade Fe a Ae 


_ [ea\? oy\? 


As a further example of the representation of a surface in parametric 
form we consider the anchor ring or torus. This is obtained by rotating a 
circle about a line which lies in the plane of the circle and does not intersect 
it (cf. fig. 18). If we take this axis of rotation as the z-axis and choose the 
y-axis in such a way that it passes through the centre of the circle, whose 
y-co-ordinate we denote by a, and if the radius of the circle is r <<{a|, 
we obtain in the first instance 


z=0, y—a=rcos0, z=rsin@ (0 S60 < 2r) 


Fig. 17.—Parametric curves u = const., 
v = const. 


where 
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as @ parametric representation of the circle in the yz-plane. Now letting 
the circle rotate about the z-axis, we find that for each point of the circle 
x? - y? remains constant, that is, 
e+ y= (a+rcos6)?, Thus if 
the angle of rotation about the 
z-axis is denoted by » we have 


Z 


x= (a + 7 cos) sing, 

y = (a+ 7 cos) cosq, 
OSqo<2n 

z= rsin®@ 0S6< 22 


as a parametric representation of 

the anchor ring in terms of the 

parameters 9 and 9. In this re- 

presentation the anchor ring ap- 

Fig. 18.—Generation of an anchor ring by Pears as the image of a square of 

the rotation of a circle side 2x in the 0@-plane, where any 

pair of boundary points lying on 

the same line 6 = const. or » = const. corresponds to only one point 

on the surface, and the four corners of the square all correspond to the 
same point. 

For the line element on the anchor ring we have 


ds? = 72.6? + (a + 7 cos6)*? dg’, 


2. Conformal Representation in General. 


A transformation 
f= H(z, Y)s y= P(x, y) 


is called a conformal transformation if any two curves are trans- 
formed by it into two others which make the same angle with 
each other as the original ones do. 

Theorem.—A necessary and sufficient condition that our (con- 
tinuously differentiable) transformation should be conformal is 
that the Cauchy-Riemann equations 


bu — Py = 0, dy t+ $a= 0 


da + py = 0, dy — tz = 0 


hold. In the first case the direction of the angles is preserved, 
in the second case the direction is reversed.* 
Proof.—We assume that the transformation is conformal. 


or 


* This last statement follows directly from the statements on p. 151 con- 
cerning the sign of the Jacobian ¢zvby — dy pe 
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Then the two orthogonal curves € = const., 7 = const. in the 
éy-plane must correspond to orthogonal curves (x, y) = const. 
and (2, y) = const. in the zy-plane. 

Hence from the formula for the angle between two curves 
(p. 126) it follows immediately that 


bu%e + dyty = 0. 


In the same way, the curves corresponding to £ + » = const. 
and €— 7 = const. must be orthogonal. This gives 


(de + bebe a 1) + (by =P ty)(dy = by) = 0, 
and therefore 
a + of = pa = 7. 


The first of our equations can be written in the form 


ba = Ay, by = —Afes 
where A denotes a constant of proportionality. Introducing this 
in the second equation, we immediately get A2 = 1, so that one 
or other of our two systems of Cauchy-Riemann equations holds. 

That the equations are a sufficient condition is confirmed by 
the following remark: 

If two curves in the zy-plane are given by equations 
F(x, y) = 0, G(x, y) = 0 and if according to our transformation 
F(a, y) = O(€, n), G(x, y) =T(E, n), then by using the Cauchy- 
Riemann equations we readily obtain 


FP + F2= (02 + 9,)(¢27 + $,7), 
G2 + G? = (TU? + T,?)(?." + ¢$,’), 
F On +F vy = (0,1; + OT)" ++ $y”); 
therefore 
FG: + F,G, no Ge oP; ot et ae 
V (Poo + PPV G2 + G7) VOPR + ©?)/(P2 + Py 
That is, the curves F=0, G=0 and their images O= 0, 
TI = 0 make the same angle with each other. 


EXAMPLES 
1. (a) Prove that the stereographic projection of the unit sphere on 
the plane is conformal. 


(6) Prove that circles on the sphere are transformed either into circles 
or into straight lines in the plane. 
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‘(c) Prove that in stereographic projection reflection of the spherical 
surface in the equatorial plane corresponds to an inversion in the uv-plane. 


(d) Find the expression for the line element on the sphere in terms of 
the parameters wu, v 


2. Calculate the line element 
(a) on the sphere 
z= cosu sin, y = sinu sinv, 2 = cosy; 
(b) on the hyperboloid 
x= cosu coshv, y = sinw cosh», z = sinh»; 
(c) on a surface of revolution given by 
r= V(x9 + ¥*) = f(z), 


using the cylindrical co-ordinates z and 6 = arc tan 2 as co-ordinates on 
the surface; 


(d)* on the quadric t= const. of the family of confocal quadrics given by 
SL alae aes aie ORC D 


a—t b-—t c-—t 


using ¢, and ¢, as co-ordinates on the quadric (cf. Ex. 6, p. 158). 
3. Prove that if a new system of curvilinear co-ordinates 7, 8 is intro- 
duced on a surface with parameters u, v by means of the equations 


u= Ur, 8), v= wr, 8), 
then 


HG’ — F? = (BG — aes vale 
ar, 8) 


where E’ , F’, @’ denote the fundamental quantities taken with respect to 
r,s and #, F, G those taken with respect to u, v. 


4, Let ¢ be a tangent to a surface S at the point P, and consider the 
sections of S made by all planes containing t. Prove that the centres of 
curvature of the different sections lie on a circle. 


5. If tis a tangent to the surface S at the point P, we call the curvature 
of the normal plane section through ¢ (i.e. the section through ¢ and the 
normal) at that point the “‘ curvature (&) of S in the direction ¢”. For 
every tangent at P we take the vector with the direction of i, initial point P, 


and length a Prove that the final points of these vectors lie on a conic, 


6*, A curve is given as the intersection of the two surfaces 
M+ yt eo l 
ax* + by? + cz? = 0. 
Find the equations of 
(a) the tangent, 
(6) the osculating plane, at any point of the curve. 
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5. Famities oF Curves, Famiiizrs oF SURFACES, AND 
THEIR ENVELOPES 


1. General Remarks. 


‘On various occasions we have already considered curves or 
surfaces not as individual configurations, but as members of a 
family of curves or surfaces, such as f(z, y) = c, where to each 
value of ¢ there corresponds a different curve of the family. 

For example, the lines parallel to the y-axis in the zy-plane, that is, the 
lines = c, form a family of curves. The same is true for the family of 
concentric circles 2* +- y* = c* about the origin; to each value of ¢ there 
corresponds a circle of the family, namely the circle with radius c. Similarly, 
the rectangular hyperbolas zy = c form a family of curves, sketched in fig. 2, 
p. 113. The particular value c = 0 corresponds to the degenerate hyperbola 
consisting of the two co-ordinate axes. Another example of a family of 
curves is the set of all the normals to a given curve. If the curve is given 
in terms of the parameter ¢ by the equations & = 9(t), n= (#), we obtain 
the equation of the family of normals in the form 


(z — 9(4)) ot) + (y— HO) Y') = 9, 
where ¢ is used instead of ¢ to denote the parameter of the family. 


The general concept of a family of curves can be expressed 
analytically in the following way. Let 


S (a, y, ¢) 


be a continuously differentiable function of the two independent 
variables x and y and of the parameter c, this parameter varying 
in a given interval. (Thus the parameter is really a third indepen- 
dent variable, which is lettered differently simply because it plays 
a different part.) Then if the equation 


f(z, y, 0) = 9 


for each value of the parameter c represents a curve, the aggregate 
of the curves obtained as c describes its interval is called a family 
of curves depending on the parameter c. 

The curves of such a family may also be represented in para- 
metric form by means of a parameter t of the curve, in the form 


c= H(t, ¢), io v(t, ¢), 


where c is again the parameter of the family. If we assign ca 
7e (£912) 
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fixed value, these equations represent a curve with the parameter ¢. 
For example, the equations 
x=ccost, y=csint 
represent the family of concentric circles mentioned above; again, the 


equations 1 


“= cl, Y=% 


represent the family of rectangular hyperbolas mentioned above, except 
for the degenerate hyperbola consisting of the co-ordinate axes. 


Occasionally we are led to consider families of curves which 
depend not on one parameter but on several parameters. For 
example, the aggregate of all circles (x — a)? +- (y— bP? =e 
in the plane is a family of curves depending on the three para- 
meters a, b, c. If nothing is said to the contrary, we shall always 
understand a family of curves to be a “ one-parameter ” family, 
depending on a single parameter. The other cases we shall dis- 
tinguish by speaking of two-parameter, three-parameter, or multi- 
parameter families of curves. 

Similar statements of course hold for families of surfaces in 
space. If we are given a continuously differentiable function 
f(z, y, 2, ¢), and if for each value of the parameter c in a certain 
definite interval the equation 


S(@, y, 2 ©) = 90 


represents a surface in the space with rectangular co-ordinates 
Z, y, 2, then the aggregate of the surfaces obtained by letting c 
describe its interval is called a family of surfaces, or, more precisely, 
@ one-parameter family of surfaces with the parameter c. For 
example, the spheres z*-+ y2-+ 22 = c¢? about the origin form 
such a family. As with curves, we can also consider families of 
surfaces depending on several parameters. 


Thus the planes defined by the equation 
az + by+Vi—@— 8z+1=0 


form a two-parameter family, depending on the parameters @ and 3, if 
the parameters a and 6 range over the region a? + 62 <1. This family of 
surfaces consists of the class of all planes which are at unit distance froni 
the origin.* 


* Sometimes a one-parametric family of surfaces is referred to as oo! surfaces, 
a two-parametric family as 0? surfaces, and so on. 
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2. Envelopes of One-Parameter Families of Curves. 


If a family of straight lines is identical with the aggregate of 
the tangents to a plane curve H—as e.g. the family of normals of 
a curve C is identical with the family of tangents to the evolute 
E of C (cf. Vol. I, p. 308)—we shall say that the curve £ is the 
envelope of the family of lines. In the same way we shall say that 
the family of circles with radius 1 and centre on the z-axis, that 
is, the family of circles with the equation (« — c)*+ y2—1= 0, 
has the pair of lines y= 1 and y= —1, which touch each of 


(es 


Fig. 19.—Family of circles with envelope 


the circles, as its envelope (fig. 19). In these cases we can obtain 
the point of contact of the envelope and the curve of the family 
by finding the intersection of two curves of the family with 
parameter values c and c++ A and then letting h tend to zero. 
We may express this briefly by saying that the envelope is the 
locus of the intersections of neighbouring curves. 

With other families of curves it may again happen that a 
curve FE exists which at each of its points touches some one of the 
curves of the family, the particular curve depending of course 
on the point of # in question. We then call H the envelope of 
the family of curves. The question now arises of finding the 
envelope E of a given family of curves f(x, y, c) = 0. We first 
make a few plausible remarks, in which we assume that an 
envelope # does exist and that it can be obtained, as in the 
above cases, as the locus of the intersections of neighbouring 
curves.* We then obtain the point of contact of the curve 


* Since this last assumption will be shown by examples to be too restrictive, 
we shall shortly replace these plausibilities by a more complete discussion. 
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f(x, y, ¢) = 0 with the curve F in the following way. In addition 
to this curve we consider a neighbouring curve f(z, y, ¢ + h) = 0, 
find the intersection of these two curves, and then let h tend to 
zero. The point of intersection must then approach the point 
of contact sought. At the point of intersection the equation 


S (2, y, ¢-+ h) — f(a, ¥y; ¢) _9 
h 


is true as well as the equations f(x, y,c-+ h) = Oand f(z, y, c) =0 
In the first equation we perform the passage to the limit h > 0. 
Since we have assumed the existence of the partial derivative /f,, 
this gives the two equations 


Sey, c)=0, f(z, y, c) =0 


for the point of contact of the curve f(z, y, c)=0 with the 
envelope. If we can determine x and y as functions of c by means 
of these equations, we obtain the parametric representation of a 
curve with the parameter c, and this curve is the envelope. By 
elimination of the parameter c it can also be represented in the 
form g(x, y) = 0. This equation is called the “ discriminant ” of 
the family, and the curve given by the equation g(x, y) = 0 is 
called the “ discriminant curve ”. 

We are thus led to the following rule: im order to obtain the 
envelope of a family of curves f(x, y, c) = 0, we consider the two 
equations f(x, y, c)=0 and f,(x, y, c) = 0 simultaneously and 
attempt to express x and y as functions of c by means of them or to 
eliminate the quantity c between them. 

We shall now replace the above heuristic considerations by a 
more complete and more general discussion, based on the definition 
of the envelope as the curve of contact. At the same time we shall 
learn under what conditions our rule actually does give the 
envelope, and what other possibilities present themselves. 

We assume to begin with that # is an envelope which can be 
represented in terms of the parameter c by two continuously 
differentiable functions 


L= Ac), y= y(c), 
2 2 
where *) + (¢ =+ 0, and which at the point with para- 


meter c touches the curve of the family with the same value of the 
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parameter c. In the first place, the equation f(z, y, c) = 0 is 
satisfied at the point of contact. If in this equation we substitute 
the expressions z(c) and y(c) for x and y, it remains valid for all 
values of ¢ in the interval. On differentiating with respect to ¢ 
we at once obtain 


dx dy 
= = = 0. 
Ss dc +f, dc +f 
Now the condition of tangency is 
dx dy__o. 
Ieig t qe 


for the quantities dx/dc and dy/de are proportional to the direction 
cosines of the tangent to H and the quantities f, and f, are pro- 
portional to the direction cosines of the normal to the curve 
f(z, y, ¢) = 0 of the family, and these directions must be at right 
angles to one another. It follows that the envelope satisfies the 
equation f, = 0, and we thus see that the rule given above is a 
necessary condition for the envelope. 

In order to find out how far this condition is also sufficient, 
we assume that a curve EZ represented by two continuously dif- 
ferentiable functions += 2(c) and y= (ec) satisfies the two 
equations f(x, y, c) = 0 and f,(z, y, c) = 0. In the first equation 
we again substitute z(c) and y(c) for x and y; this equation then 
becomes an identity in c. If we differentiate with respect to c 
and remember that f, = 0, we at once obtain the relation 


da, , dy_ 
fer tha =% 


which therefore holds for all points of Z. If the two expressions 
foe+ fi and (da/dc)? + (dy/dc)? both differ from zero at a point 
of #, so that at that point both the curve F and the curve of the 
family have well-defined tangents, this equation states that the 
envelope and the curve of the family touch one another. With 
these additional assumptions our rule is a sufficient condition for 
the envelope as well as a necessary one. If, however, f, and f, 
both vanish, the curve of the family may have a singular point 
(cf. section 2, p. 128), and we can draw no conclusions about 
the contact of the curves. 

Thus after we have found the discriminant curve it is still 
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necessary to make a further investigation in each case, in order 
to discover whether it is really an envelope or to what extent 
it fails to be one. 

In conclusion we state the condition for the discriminant 
curve of a family of curves given in parametric form 


r= g(t,c), y= H(t, e), 
with the curve parameter ¢. This is 


bie — tehi = 0. 


We can readily obtain it e.g. if we pass from the parametric 
representation of the family to the original expression by elimina- 
tion of ¢. 


3. Examples. 


l. (7 — c+ y2=1. As we have seen on p. 171, this equation re- 
presents the family of circles of unit radius whose centres lie on the x-axis 
(fig. 19). Geometrically we see at once that the envelope must consist of 
the two lines y= 1 and y= —1. We can verify this by means of our 
rule; for the two equations (x —c)?+ y2=1 and —2(z— c)= 0 im- 
mediately give us the envelope in the form y* = 1. 


y 


C; Cp Cs C4 Cs 


Fig. 20.—Family of parabolas with envelope 


2. The family of circles of unit radius passing through the origin, 
whose centres, therefore, must lie on the circle of unit radius about the 
origin, is given by the equation 

(% — cosc)? + (y — sinc)? = 1 


or 
a + y? — 2x cose — 2y sine = 0. 


The derivative with respect to c equated to zero gives x sinc — y cosc = 0, 
These two equations are satisfied by the values x=0 and y= 0. 
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If, however, 2° -++ y? + 0, it readily follows from our equations that 
sinc = y/2, cosc = 2/2, so that on eliminating ¢ we obtain z* + y* = 4. 
Thus for the envelope our rule gives us the circle of radius 2 about the 
origin, as is anticipated by geometrical intuition; but it also gives us the 
isolated point z= 0, y= 0. 

3. The family of parabolas (x — c)* — 2y = 0 (cf. fig. 20) also has an 
envelope, which both by intuition and by our rule is found to be the x-axis. 


4, We next consider the family of circles (@ — 2c)?+y2—=0 


Fig. 21.—The family (x — 2c)? + y? —c¥ = 0 


(cf. fig. 21). Differentiation with respect to ¢ gives 2x — 3c = 0, and by 
substitution we find that the equation of the envelope is 


an 
¥= 33 


that is, the envelope consists of the two lines y = Fs zand y= — a x. 
The origin is an exception, in that contact does not occur there. 

5. Another example is the family of straight lines on which unit length 
is intercepted by the z- and y-axes. If «= is the angle indicated in 
fig. 22, these lines are given by the equation 


oe Need, 
cose sing 
The condition for the envelope is 
sin a xy — 008% 
cos? sin?a 


y=0, 


which, in conjunction with the equation of the lines, gives the envelope in 
parametric form, 
z= costa, y= sina. 
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Fig. 22.—Are of the astroid as envelope of straight lines 


From these we obtain the further equation 
8 4 yl = 1, 


This curve is called the astroid (cf. Vol. 1, Chap. V, Ex. 6, p. 267). It 
consists (figs. 23, 24) of four symmetrical branches meeting in four cusps. 


Fig. 23.—Astroid Fig. 24.--Astroid as envelope of ellipses 


6. The astroid 2°" + y= 1 also appears as the envelope of the 
family of ellipses 
Eo y 


@'a—e 


whose semi-axes c and (1 — c) have the constant sum 1 (fig. 24). 
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7. The family of curves (x — c)? — y® = 0 shows that in certain cir. 
cumstances our process may fail to give an envelope. Here the rule gives 


y 


[_—________» % 


Cy C2 0 3 Cy Cs 
Fig. 25.—The family (x — c)*? — 9° = 0 


the z-axis. But, as fig. 25 shows, this is not an envelope; it is the locus 
of the cusps of the curves of the family. 


8. In the case of the family 


(e— of —y*=0 
we again find that the discriminant curve is the z-axis (cf. fig. 26). This 
y 


Fig. 26.—The family (x — c)}*§— 77 = 0 


is again the cusp-locus; but it touches each of the curves, and in this 
sense must be regarded as the envelope. 


9. Another example, in which the discriminant curve consists of the 
envelope plus the locus of the double points, is given by the family of 


ercuhane [8 + (y—e)'](c— 2) +2=0 
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(ef. fig. 27). All the curves of the family are similar to each other and 
arise from one another by translation parallel to the y-axis. By differen- 
tiation we obtain f, = —2(y — c)(# — 2) = 0, 
so that we must have either = 2 or y=. 
The line x = 2 does not enter into the matter, 
however, for no finite value of y corresponds 
to «= 2. We therefore have y= c, so that 
the discriminant curve is 2°(2 — 2)+ z= 0. 
This curve consists of the two straight lines 
z=0 and «=1. As we see from fig. 27, 
only x= 0 is the envelope; the line x= 1 
passes through the double points of the curves. 


Fig. 27.— Family of strophoids Fig. 28.— Family of cubica! parabolas 


10. The envelope need not be the locus of the points of intersection 
of neighbouring curves; this is shown by the family of identical parallel 
cubical parabolas y — (x — c)® = 0. No two of these curves intersect 
each other. The rule gives the equation f, = 3(a% — c)? = 0, so that the 
x-axis y = 0 is the discriminant curve. Since all the curves of the family 
are touched by it, it is also the envelope (fig. 28). 


11. The notion of the envelope enables us to give a new definition for 
the evolute of a curve C (cf. Vol. I, pp. 283, 307 e¢ seg.). Let C be given 
by « = 9(t), y = $(t). We then define the evolute # of C as the envelope 
of the normals of C. As the normals of C are given by 


{x — o(t)} et) + fy — HOV = 0, 
the envelope is found by differentiating this equation with respect to ?: 
0 = {x — oft} et) + {y — UE) OQ — o%(t) — YZ). 


From this equation and the preceding one we obtain the parametric re- 
presentation of the envelope, 


ny P2+ 02 b’e 
= _ t) ————— = ceuelley ay arr wT 4 
z= o(t)— ¥') Veo ° Vette 

, et + y”? ee 
= t = > 
¥y 70) + ¢( ) yo’ ot’ p a1 V (9% + py’) 

where 
(p? + ¥?) 


yeaa 
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denotes the radius of curvature (cf. Vol. I, p. 281). These equations are 
identical with those given in Vol I, p. 283 for the evolute. 


12. Let a curve C be given by = o(t), y = Y(t). We form the envelope 
E of the circles having their centres on C and passing through the origin O. 
Since the circles are given by 


2 -+ y* — 2xo(t) — 2yp(t) = 0, 


the equation of 2 is 
x@’(t) + yp(é) = 0. 


Hence if P is the point (9(t), }(é)) and Q(z, y) the corresponding point 
of #, then OQ is perpendicular to the tangent to C at P. Since by definition 
PQ = PO, PO and PQ make equal angles with the tangent to C at P. 

If we imagine O to be a luminous point and C a reflecting curve, then 
QP is the reflected ray corresponding to OP. The envelope of the reflected 
rays is called the caustic of C with respect to O. The caustic is the evolute 
of H. For the reflected ray PQ is normal to Z, since a circle with centre 
P touches # at Q, and the envelope of the normals of £ is its evolute, as 
we saw in the preceding example. 

For example, let C be a circle passing through O. Then F is the path 
described by the point O’ of a circle C’ congruent to C which rolls on C 
and starts with O and O’ coincident. For during the motion O and O’ 
always occupy symmetrical positions with respect to the common tangent 
of the two circles. Thus Z will be a special epicycloid, in fact, a cardioid 
(cf. Vol. I, p. 267, Ex. 2 and 3). As the evolute of an epicycloid is a similar 
epicycloid (cf. Vol. I, p. 311, Ex. 1), the caustic of C with respect to O is in 
this case a cardioid. 


4. Envelopes of Families of Surfaces. 


The remarks made about the envelopes of families of curves 
apply with but little alteration to families of surfaces also. If 
in the first instance we consider a one-parameter family of surfaces 
S(%, y, 2, ¢)= 0 in a definite interval of parameter values c, we shall 
say that a surface # is the envelope of the family if it touches 
each surface of the family along a whole curve, and if further 
these curves of contact form a one-parameter family of curves on 
& which completely cover 2. 


An example is given by the family of all spheres of unit radius with 
centres on the z-axis. We see intuitively that the envelope is the cylinder 
a? + y? — 1= 0 with unit radius and axis along the z-axis; the family 
of curves of contact is simply the family of circles parallel to the xy-plane, 
with unit radius and centre on the z-axis.* 


* The envelopes of spheres of constant radius whose centres lie along curves 
are called tube-surfaces. 
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As in sub-section 2 (p. 172), if we assume that the envelope 
does exist we can find it by the following heuristic method. We 
first consider the surfaces f(z, y, z,c)=0 and f(a, y, z,e+h)=0 
corresponding to two different parameter values c and ¢-+- h. 
These two equations determine the curve of intersection of the 
two surfaces (we expressly assume that such a curve of inter- 
section exists). In addition to the two equations above, this 
curve also satisfies the third equation 


fy 46+ h—-fOGY4)_ 9 
Coe i sa 


If we let tend to zero, the curve of intersection will approach a 
definite limiting position, and this limit curve is determined by 
the two equations 


St (a, Y, %, c) = 0, SFA&, Y; 2, c) = 0. 


This curve is often referred to in a non-rigorous but intuitive 
way as the intersection of “neighbouring” surfaces of the 
family. It is still a function of the parameter c, so that all the 
curves of intersection for the different values of c form a one- 
parameter family of curves in space. If we eliminate the quantity 
e from the two equations above we obtain an equation, which 
is called the “discriminant ”. As in sub-section 2 (p. 172), we 
can show that the envelope must satisfy this discriminant 
equation. 

Just as in the case of plane curves, we may readily convince 
ourselves that a plane touching the discriminant surface also 
touches the corresponding surface of the family, provided that 
Je+f7+ 2+ 0. Hence the discriminant surface again gives 
the envelopes of the family and the loci of the singularities of 
the surfaces of the family. 


As a first example we consider the family of spheres 
a+ y+ (2— of —1=0 
mentioned above. To find the envelope we have the additional equation 
—2(z —c) = 0. 


For fixed values of ¢ these two equations obviously represent the circle 
of unit radius parallel to the zy-plane at the height z = ¢. If we eliminate 
the parameter ¢ between the two equations, we obtain the equation of the 
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envelope in the form 2?-+ y*—1= 0, which is the equation of the 
right circular cylinder with unit radius and the z-axis as axis. 


While for families of curves the formation of the envelope 
has a meaning only for one-parameter families, in the case of 
families of surfaces it is also possible to find envelopes of two- 
parameter families f(x, y, 2, ¢, C,)=0. If, for example, we consider 
the family of all spheres with unit radius and centre on the 
ay-plane, represented by the equation 


(2 — 4)? + (y—@)? + 2#—1=9, 


intuition at once tells us that the two planes z= 1 and z= —1 
touch a surface of the family at every point. In general we shall 
say that a surface HZ is the envelope of a two-parameter family 
of surfaces if at every point P of E the surface E touches a surface 
of the family in such a way that as P ranges over E the parameter 
values ¢,, C, corresponding to the surface touching E at P range 
over a region of the ¢,c,-plane, and in addition different points 
(c, Cg) correspond to different points P of E. A surface of the 
family then touches the envelope in a pam, and not, as before, 
along a whole curve. 

With assumptions similar to those made in the case of plane 
curves, we find that the point of contact of a surface of the family 
with the envelope, if it exists, must satisfy the equations 


S(%, Y, 2, Cy Co) = 0,  fe,(a, Y, 2 Cy, Co) = 0, f,(@, Ys % Cr Co) = O. 


From these three equations we can in general find the 
point of contact of each separate surface by assigning the corre- 
sponding values to the parameters. If, conversely, we eliminate 
the parameters c, and c,, we obtain an equation which the en- 
velope must satisfy. 

For example, the family of spheres with unit radius and centre on the 
xy-plane is given by the equation 

IY % Cy ) = (2 — GP + (Y— GP 4+ A—1=0 
with the two parameters ¢, and c,. The rule for forming the envelope 
gives the two equations 

fe, = —Ue—e)=0 and f= —Ay— ) = 0. 
Thus for the discriminant equation we have z* — 1 = 0, and in fact the 


two planes z= 1 and z= —1 are envelopes, as we have already seen 
intuitively. 
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ExAaMPLes 


1, Let z = u(x, y) be the equation of a tube-surface, i.e. the envelope 
of a family of spheres of unit radius with their centres on some curve 
y = f(x) in the ay-plane. Prove that 


wu? + uy? + 1)= 1. 


2. (a) Find the envelope of the two-parameter family of planes for 


which 
OP + OQ + OR = const. = 1, 


where P, Q, R denote the points of intersection of the planes with the 
co-ordinate axes and O the origin. 
(b) Find the envelope of the planes for which 


OP? + OQ? + OR? = 1. 


3. Let C be an arbitrary curve in the plane, and consider the circles 
of radius p whose centres lie on C. Prove that the envelope of these circles 
is formed by the two curves parallel to C at the distance p (cf. the 
definition of parallel curves, Vol. I, p. 291). 

4*, A family of straight lines in space may be given as the intersection 
of two planes depending on a parameter ft: 


a(t)e + B(é)y + e{t)z2 = 1 
Ut)e + ety + fiz = 1. 


Prove that if these straight lines are tangents to some curve, i.e. possess 
an envelope, then ; 
a—d b—e c—f 
a’ bv’ c|=0. 


5*, A family of planes is given by 
acost + ysint+ z= 4, 


where ¢ is a parameter. 

(a) Find the equation of the envelope of the planes in cylindrical co- 
ordinates (7, z, 8). 

(6) Prove that the envelope consists of the tangents to a certain 
curve. 

6. If a body is always thrown from the same initial position with the 
same initial velocity but at different angles, its trajectories form a family 
of parabolas (it is assumed that the motion always takes place in the same 
vertical plane). Prove that the envelope of these parabolas is another 
parabola. 

7*, Find the envelope of the family of spheres which touch the three 
spheres 
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8: (@— 38+ y+2=—% 
S,: z+ (y — 2R4+2= 2, 
Sy a+ y+ (2 — ZF = 
8. If a plane curve C is given by x= f(t), y = g(t), ita “ polar re- 
ciprocal” C’ is defined as the envelope of the family of straight lines 
Ef + ng) =1L 
where (&, 4) are current co-ordinates. 
(a) Prove that C is the polar reciprocal of C’ also. 
(b) Find the polar reciprocal of the circle 


(e— a? + (y— bP =1. 
(c) Find the polar reciprocal of the ellipse 
a ee 


a? 0b? 


_ 6 Maxima anp Minima 


1. Necessary Conditions. 


The theory of maxima and minima for functions of several 
variables, like that for functions of a single variable, forms one 
of the most important applications of differentiation. 

We shall begin by considering a function u = f(x, y) of two 
independent variables x, y, which we shall represent by a surface 
in ayu-space. We say that this surface has a maximum with the 
co-ordinates (2%, yo) if all the other values of u in a neighbour- 
hood of that point (all round the point) are less than u(zp, Yo). 
Geometrically, such a maximum corresponds to a “ hill-top”’ on 
the surface. In the same way, we shall call the point (1, yo) a 
minimum if all other values of the function in a certain neigh- 
bourhood of P(x», yo) are greater than uy = u(%q, Yo). Just as 
with functions of one variable, these concepts always refer only 
to a sufficiently small neighbourhood of the point in question. 
Considered as a whole, the surface may very well have points 
which are higher than the hill-tops. Analytically, we formulate 
our definition as follows, so that it applies to functions of more 
than two independent variables: 

A function u= f(x, y, ..-) has a maximum (or a minimum) 
at the point (Xo, Yo, -- .) of at every point in a neighbourhood of 
(Xo Yo: - - -) the function assumes a smaller value (or a larger 
value) than at the point itself. 
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If in the neighbourhood of (2, yo, . . .) the function assumes 
values which are not greater than the value of the function at the 
point (but may be equal to it), we say that the function has an 
improper maximum at the point. We define an umproper minimum 
in a similar way. 


We again emphasize that this definition refera to a suitably chosen 
neighbourhood of the point, extending in all directions about the point. 
Thus in a closed region the value of a maximum may very well lie below 
the greatest value assumed by the function in the region.* If the greatest 
value is reached at a point P, of the boundary, it need not be a maxi- 
mum in the sense defined above, as we have already seen for functions of 
one variable. For if the function is defined in the closed region only, we 
cannot find a complete neighbourhood of P, in which the function is 
defined; and if, on the other hand, the closed region is contained in a larger 
region in which the function is defined, then in this larger region the 
function may not have a maximum at P,, as the following example shows. 
The function w= —2a— y is defined over the whole ay-plane, but we 
consider it only in the square 0 <2 <1,0S y <1. In this closed region 
it reaches its greatest value 0 at the origin. This greatest value, however, 
is not a maximum. For if we consider a neighbourhood all round the 
origin, we find that the function assumes values greater than zero. If, 
however, we know that the greatest or least value of a function. is 
assumed at a single point interior to the region, that point must necessarily 
be a maximum or a minimum in the sense defined above. 


We shall first give necessary conditions for the occurrence of 
an extreme value. (As in the case of functions of one variable, we 
use the terms t extreme value, extreme point when we do not wish 
to distinguish between maxima and minima.) That is, we find 
conditions which must be satisfied at a point (29, yo, . . .) if there 
is to be an extreme value at that point. The equations 


Fa(Xq; Yo 2o - + .) = 0, 
SulZo, Yo: 20 - + -) = 9, 
FA%o; Yo: Zo, » + ) = 0, 


° . . . . . . . 


are necessary conditions for the occurrence of a maximum or mini- 
mum of a differentiable function u = £(x, y, z, .. .) at the point Py 
with co-ordinates (Xo, Yo; Zp, . « -)- 


* We already know (cf. p. 97) that a continuous function always assumes 
@ greatest and a least value in a closed region. 

f On the other hand, as will be seen later (p. 186), the terms stationary value, 
stationary point include points which are neither maxima nor minima. 
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In fact, these conditions follow at once from the known 
conditions for functions of one independent variable. If we 
consider the variables y, z, ... as fixed at the values Yo, 2; . - + 
and regard the function in the neighbourhood of P, as a function 
of the single variable x, this function of x must have an extreme 
value at the point x= 2, and by our previous results we must 
have f,(Xo, Yo, 2, - - -) = 0. 

Geometrically, the vanishing of the partial derivatives in the case of 


functions of two independent variables means that at the point (29, Yo) 
the tangent plane to the surface u = f(z, y) is parallel to the xy-plane. 


For many purposes it is more convenient to combine the 
conditions in one equation. This equation is 


af (xo; Yo: 2 + - -) = f,(Xp, Yo: 2 + - .) da + filo, Yo 20 + + .)dy 
+ flo, Yor % +. )d2+...= 0. 
In words: at an extreme point the differential (linear part of the 


increment) of the function must vanish, no matter what values 
we assign to the differentials dx, dy, dz, ... of the independent 


variables 2, y, z,.... Conversely, if the above equation is satis- 
fied for arbitrary values of dx, dy, . . . it follows that at the given 
point f, = f,=...=0. We have only to take all but one of 


the (mutually independent) variables equal to zero. 
In the equations 


SelXq; Yo 2 + - ) as 0, 
Silos Yos 2, - - -) = 9, 
SA®o: Yor 2 - » -) = 9, 


there are as many unknowns Zp, Yo, 2%, . - . a8 there are equations. 
As a rule, therefore, we can calculate the position of the extreme 
points by means of them. But a point obtained in this way need 
not by any means be an extreme point. 


We consider e.g. the function w= zy. Our two equations at once give 
x =.0,¥= 0. In the neighbourhood of the point z = 0, y = 0, however, 
the function assumes both positive and negative values, according to the 
quadrant. The function therefore has not an extreme value there. The 
geometrical representation of the surface w= zy, which is a hyperbolic 
paraboloid, shows that the origin is a saddle point (cf. fig. 1, p. 112). 


It is useful to have a simple expression for a point at which 
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the above equations are satisfied, irrespective of whether the 
function has an extreme point or not. We accordingly say that 
if there is a point (Xp, Yo, 29, . . -) at which f, = 0, f, = 0, f, = 0, 
..., or at which 


df = f,dx + f,dy +f.de+...=0, 


the function has a stationary value at that (stationary) point 
(cf. footnote, p. 184). 

Every point interior to a closed region at which a differentiable 
function assumes its greatest or its least value is a stationary 
point. 

To decide whether and when our system of equations really 
gives an extreme value, we must make further investigations. 
In many cases, however, the state of affairs is clear from the 
outset, in particular, if we know that the greatest or least value 
of the function must be assumed at an interior point P of the 
region and find that our equations determine only a single 
stationary system 2 = %, Y= Yo, -.-.. This system of values 
must then determine the point P, which is necessarily a stationary 
point. If such considerations do not apply, however, we must 
investigate the matter more closely; this we postpone to the 
appendix to this chapter (p. 204). Meanwhile we shall illustrate 
the foregoing results by means of some examples. 


2. Examples. 


1. For the function wu = 2? + y® the partial derivatives vanish only 
at the origin, so that this point alone can be an extreme point. The function 
actually has a minimum, for at all points (x, y) different from (0, 0) the 
function u = 27+ y* must be positive, being a sum of squares. 


2. The function 
u= V(1— 8 — yy), (+ y? <1) 


has the partial derivatives 
7 y 


eT Joep | UN ay 


and these vanish only at the origin. Here we have a maximum, for at all 
other points (xz, y) in the neighbourhood of the origin the quantity 
1 — 2? — y* under the square root is less than it is at the origin. 

3. We wish to construct the triangle for which the product of the sines 
of the three angles is greatest; that is, we wish to find the maximum of 


the function ; . . 
fl, y) = sinz siny sin(x + y) 
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in the region0 Sa@Sn,0SySr,0Set+ySn. Since f is positive 
in the interior of this region, its greatest value is positive. On the boundary 
of the region, where the equality sign holds in at least one of the in- 
equalities defining the region, we have f(z, y) = 0, so that the greatest 
value must lie in the interior. 

If we equate the derivatives to zero, we obtain the two equations 


cosz siny sin(x + y) + sinz siny cos(z + y) = 0, 
sinz cosy sin(z + y) + sina siny cos(% + y) = 0. 


Since O< a< nr, 0<y<7,0<2+y <7, these give tanz= tany, 
or x== y. If we substitute this value in the first equation, we obtain the 


relation sin3z = 0; hence # = 7 y= 4 is the only stationary point, and 


the required triangle is equilateral. 

4, Three points P,, P,, Ps, with co-ordinates (2,, ¥,), (%2, Ye), and (2, Ys) 
respectively, are the vertices of an acute-angled triangle. We wish to 
find a fourth point P with co-ordinates (x, y) such that the sum of its 
distances from P,, P,, and Ps; is the 
least possible. This sum of distances 
is a continuous function of z and y, 
and at some point P inside a large 
circle enclosing the triangle it has a 
least value. This point P cannot lie 
at a vertex of the triangle, for then 
the foot of the perpendicular from one 
of the other two vertices on to the Fig. 29.-—Three vectors with equal 
opposite side would give a smaller sum magnitudes and sum zero 
of distances. Again, P cannot lie on 
the circumference of the circle, if this is sufficiently far away from the 
triangle. With the distances 


4=Ve— uP t+ y— v? 
we now form the function 
f(@ y= t+ t2+ ty 


which is differentiable everywhere except at P,, P,, and P;. We know 
that at the point P the partial derivatives with respect to « and y must 
vanish. Thus by differentiating f we obtain the conditions 


" Te Ts; 


for P, According to these equations the three plane vectors w,, 2%, 25, 
with components 
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T—-%m ¥—-N t—% Y— Ye t—% Y— Ys 
4°? tT 7 1, * tts 


respectively, have the vector sum 0. Also, these vectors are each of unit 
length. When combined geometrically, then, they form an equilateral 
triangle; that is, each vector is brought into the direction of the next by 
@ rotation through 3x (fig. 29). Since these three vectors have the same 
directions as the three vectors from P,, Ps, P to P, it follows that each of 
the three sides of the triangle must subtend the same angle 2x at the 
point P, 


3. Maxima and Minima with Subsidiary Conditions. 


The problem of determining the maxima and minima of 
functions of several variables frequently presents itself in a 
form differing from that treated above. If e.g. we wish to find the 
point of a given surface ¢(a, y, z) = 0 which is at the least distance 
from the origin, then we have to determine the minimum of the 


function 
I (a, Y, 2) = V(e E y + z*), 


where the quantities x, y, z, however, are no longer three in- 
dependent variables, but are connected by the equation of the 
surface $(x, y, z) = 0 as a subsidiary condition. Such “ maxima 
and minima with subsidiary conditions ” do not, indeed, represent 
a fundamentally new problem. Thus in our example we need only 
solve for one of the variables, say z, in terms of the other two, 
and then substitute this expression in the formula for the distance 
V(x? + y? + 2), to reduce the problem to that of determining 
the stationary values of a function of the two variables 2, y. 

It is, however, more convenient, and also more elegant, to 
express the conditions for a stationary value in a symmetrical 
form, in which no preference is given to any one of the 
variables. 

As a very simple case, which is nevertheless typical, we con- 
sider the following problem: to find the stationary values of a 
function {(x, y) when the two variables x, y ave not mutually inde- 
pendent, but are connected by a subsidiary condition 


$(z, y) =0. 


In order to give geometrical plausibility to the analytical treat- 
ment, we assume first that the subsidiary condition is represented, 
as in fig. 30, by a curve in the zy-plane without singularities and 
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that in addition the family of curves f(x, y) = ¢ = const. covers 
a portion of the plane, as in the figure. The problem is then 
as follows: among the curves of the family which intersect 
the curve ¢ = 0, to find that one for which the constant ¢ is the 


a. Fao 


9-0 


Fig. 30.—Extreme value of f with subsidiary condition ¢ = 0 


greatest possible or the least possible. As we describe the 
curve ¢ = 0 we cross the curves f(x, y) =, and in general c 
changes monotonically; at the point where the sense in which 
we run through the c-scale is reversed we may expect an ex- 
treme value. From fig. 30 we see that this occurs for the curve 
of the family which touches the curve ¢=0. The co-ordinates 
of the point of contact will be the required values x= £, y= 7 
corresponding to the extreme value of f(a, y). If the two curves 
J = const. and ¢ = 0 touch, they have the same tangent. Thus 
at the point «= £, y = 7 the proportional relation 


Se fy = be? by 


holds; or, if we introduce the constant of proportionality A, the 
two equations 

S at Aba =0 

S, yt Ady =0 


are satisfied. These, with the equation 
$(z, y) = 9, 


serve to determine the co-ordinates (£, 7) of the point of contact 
and also the constant of proportionality 2. 
This argument may fail, e.g. when the curve ¢== 0 has a 
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singular point, say a cusp as in fig. 31, at the point (¢, 7) at 
which it meets a curve f= c with the greatest or least. possible c. 
In this case, however, we have both 


bA€, m)=0 and 4¢,(€, n) = 0. 


Tn any case we are led intuitively to the following rule, which 
we shall prove in the next sub-section: 


i) 
§ 
POS 


Fig. 31.—Extreme value at a singular point of ¢ = 0 


In order that an extreme value of the function f(x, y) may occur 
at the point x = £, y = n, with the subsidiary condition ¢(x, y) = 0, 
the point (&, n) being such that the two equations 


gS) = 90 and $,(f, 4) =0 


are not both satisfied, it is necessary that there should be a constant 
of proportionality such that the two equations 


Tats 7) + Adal €, 7) =0 and Sulé 7) + ASAE, n) =0 
are satisfied, together with the equation 
$(E, 9) = 0. 


This rule is known as Lagrange’s method of undetermined 
multipliers, and the factor A is known as Lagrange’s multiplier. 

We observe that for the determination of the quantities ¢, 7, 
and A this rule gives as many equations as there are unknowns. 
We have therefore replaced the problem of finding the positions 
of the extreme values (€, 7) by a problem in which there is an 
additional unknown A, but in which we have the advantage of 
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complete symmetry. Lagrange’s rule is usually expressed as 
follows: 

To find the extreme values of the function f(x, y) subject to the 
subsidiary condition ¢(x, y) = 0, we add to f(x, y) the product of 
A(x, y) and an unknown factor d independent of x and y, and write 
down the known necessary conditions, 


fet Aga = 0, Sut Ad, = 9, 


for an extreme value of F = {+ Ad. In conjunction with the sub- 
sidiary condition ¢ = 0 these serve to determine the co-ordinates 
of the extreme value and the constant of proportionality A. 


Before proceeding to prove the rule of undetermined multipliers 
rigorously we shall illustrate its use by means of a simple example. We 
wish to find the extreme values of the function 


us zy 
on the circle with unit radius and centre the origin, that is, with the sub- 
sidiary condition 
et+y—l=0. 
According to our rule, by differentiating ay -+- A(z? -+- y? — 1) with respect 
to a and to y we find that at the stationary points the two equations 
y+ 2227=0 
x+ 2y=0 
have to be satisfied. In addition we have the subsidiary condition 
a+y—1l=0. 


On solving we obtain the four points 


E= 44/2, qn=h 9 
=—-37/2, n= —-3V2 

b= 3/2, q=— /2, 

B=-$V2 0 = 32 


The first two of these give a maximum value v= 4, the second two a mini- 
mum value u = —4, of the function u = zy. That the first two do really 
give the greatest value and the second two the least value of the function u 
can be seen as follows: on the circumference the function must assume a 
greatest and a least value (cf. p. 97), and since the circumference has no 
boundary point, these points of greatest and least value must be stationary 
points for the function. 
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4. Proof of the Method of Undetermined Multipliers in the 
Simplest Case. 


As we should expect, we arrive at an analytical proof of the 
method of undetermined multipliers by reducing it to the known 
case of “free” extreme values. We assume that at the extreme 
point the two partial derivatives ¢,(£, 7) and $,( 4) do not 
both vanish; to be specific, we assume that $,(€ 4) = 0. Then 
by section 1, No. 3 (p. 114), in a neighbourhood of this point 
the equation (x, y) = 0 determines y uniquely as a continuously 
differentiable function of 2, y= g(x). If we substitute this ex- 
pression in f(x, y), the function 


f(x, 9(@)) 
must have a free extreme value at the point z = ¢. For this the 
equation 
f'(2) = fat fig (2) = 0 
must hold at x=. In addition, the implicitly defined func- 
tion y= 9(x) satisfies the relation ¢,-+ $,9'(t) = 0 identically. 


If we multiply this equation by A= —/,/¢, and add it to 
Se + fyg'(z) = 0, then we obtain 


Sf et Arba = 0, 
and by the definition of A the equation 
S; at Ady = 0 


holds. This establishes the method of undetermined multi- 
pliers. 

This proof brings out the importance of the assumption that the deri- 
vatives ¢, and ¢, do not both vanish.at the point (&, y). If both these 


derivatives vanish the rule breaks down, as is shown analytically by the 
following example. We wish to make the function 


fmysety 
@ minimum, subject to the condition 
P(x, y) = («7 — 18 — y= 0. 


By fig. 32, the shortest distance from the origin to the curve (e— 1)*—~ y= 0 
is obviously given by the line joining the origin to the cusp S of the curve 
(we can easily prove that the circle with unit radius and centre the origin has 
no other point in common with the curve). The co-ordinates of S, that is, 
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xz== 1 and y= 0, satisfy the equations (x, y) = 0 and f, + A, = 0, no 
matter what value is assigned to A, but 


fe + Ady = 2a + 3X2 — 1)? = 2+ 0. 


We can state the proof of the y 
method of undetermined mul- 
tipliers in a slightly different 
way, which is particularly con- 
venient for generalization. We 
have seen that the vanishing of 
the differential of a function at a 
given point is a necessary con- 
dition for the occurrence of an 
extreme value of the function at 
that point. For the present 
problem we can also make the 
following statement: 

In order that the function 
f(x, y) may have an extreme value Fig. 32.—The surface (x — 1)*— »* =0 
at the point (£, ), subject to the 
subsidiary condition $({x, y) = 0, tt is necessary that the differential 
df shall vanish at that point, it being assumed that the differentials 
dx and dy are not independent of one another, but are chosen in 
accordance with the equation 


dp = pda + pydy = 0 


deduced from 6=0. Thus at the point (£, 7) the differentials 
dx and dy must satisfy the equation 


df = flé, n)dx + f(g, n)dy = 0 


whenever they satisfy the equation df = 0. If we multiply the 
first of these equations by a number A, undetermined in the first 
instance, and add it to the second, we obtain 


(fe+ Ada) da +- (fy + Ady) dy = 0. 
If we determine A so that 
S, v + Ady = 0, 


as is possible in virtue of the assumption that ¢, + 0, it neces- 
8 (2912) 
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sarily follows that (f, + Ad,)dv = 0, and since the differential 
dz can be chosen arbitrarily, e.g. equal to 1, we have 


fat Moe = 0. 
5. Generalization of the Method of Undetermined Multipliers. 


We can extend the method of undetermined multipliers to 
a greater number of variables and also to a greater number of 
subsidiary conditions. We shall consider a special case which 
includes every essential feature. We seek the extreme values of 
the function 
u= f(x, y, % 4), 


when the four variables 2, y, z, ¢ satisfy the two subsidiary 
diti 
conditions A(z, y, 2 t)= 0, P(x, y, 2, t) = 0. 


We assume that at the point (¢, 7, ¢, 7) the function takes a 
value which is an extreme value when compared with the values 
at all neighbouring points satisfying the subsidiary conditions. 
We assume further that in the neighbourhood of the point 
P(é, , &, 7) two of the variables, say z and t, can be represented 
as functions of the other two, z and y, by means of the equations 


d(x, y, 2, t)= 0 and P(e, y, 2, t)=0 


In fact, to ensure that such solutions z= g(x, y) and t= h(a, y) 
can be found, we assume that at the point P the Jacobian 


Op, ) _ 
A(z, t) = > by — ft Ps 


is not zero (cf. p. 153). If we now substitute the functions 


z= g(x, y) and t= h(z, y) 


in the function u = f(z, y, z, t), then f(x, y, 2, t) becomes a function 
of the two independent variables x and y, and this function 
must have a free extreme value at the point z= £, y= 7; that is, 
its two partial derivatives must vanish at that pot. The two 
equations 


forte wt et fea, 


fithsths 
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must therefore hold. In order to calculate from the subsidiary 
Oz Ot at 


az 
conditions the four derivatives occurring here, we 


dx’ dy Ou’ dy 
could write down the two pairs of equations 


dz a 
it dad 


bet te 2 Ft = 0 


and 
dy + bo Say of - 
by + 2 a be 5 
oz ot er P 
and solve them for the unknowns —, ..., —, which is possible 
ay), oy 


because the Jacobian does not vanish. The problem weuld 


then be solved. aed 

Instead, we prefer to retain formal symmetry and clarity by 
proceeding as follows. We determine two numbers A and p in 
such a way that the two equations 


Sat Adz + pi, = 0, 
Sit Ad, + pap, = 0 


are satisfied at the point where the extreme value occurs. The. 
determination of these “ multipliers” A and yu is possible, since 


we have assumed that the Jacobian ag, HS 
multiply the equations (@ t) 


dz ot dz ot 
gat Peat Ox = 9 and =p, + Yas + Hie 0 


is not zero. If we 


by A and yp respectively and add them to the equation 
Se ine "52 e+ he 


we have 


fat Met be + Sat dba + whe) 2 + (fet MBs + aah) 2 = 0. 
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Hence by the definition of A and p 


Sa + Abe + pie = 0. 
Similarly, if we multiply the equations 


oz ot 
dy t+ beat M8 
and 


ty + be at “ue 
by A and pu respectively and add them to the equation 
Si v + Re oar) "Es hs ay 0, 


we obtain the further es 
Su t+ Ady + phy = 0. 


We thus arrive at the following result: 
If the point (€, n, ¢, 7) 1s an extreme point of f(x, y, z, t) subject 
to the subsidiary conditions 
d(x, y, 2, t) = 0, 
P(x, Y; %, t) = 0, 


and tf at that point sees as not zero, then two numbers A and ps 


exist such that at the point (£, , C, 7) the equations 


Sat Ade ph, = 0, 
Fut Ady + Mipy = 0, 
Sat Ad, + ph, = 0, 
Set Ade + wih, = 9, 


and also the subsidiary conditions, are satisfied. 

These last conditions are perfectly symmetrical. Every trace 
of emphasis on the two variables x and y has disappeared from 
them, and we should equally well have obtained them if, instead 


of assuming that — ae * = 0, we had merely assumed that any 


re p) Hd, p) OF #) aia not 


one of the eet A(a, yy’ Mea Be a) 0) 
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vanish, so that in the neighbourhood of the point in question 
a certain pair of the quantities a, y, 2, ¢ (although possibly not 
z and ¢) could be expressed in terms of the other pair. For 
this symmetry of our equations we have of course paid the 
price; in addition to the unknowns &, 7, ¢, 7 we now have A 
and p also. Thus instead of four unknowns we now have six, 
determined by the six equations above. 

Here too we could have carried out the proof somewhat 
more elegantly by using the differential notation. In this notation, 
the necessary condition for the occurrence of an extreme value at 
the point P is the equation 


df = 0, 


where the differentials dz and dt are to be expressed in terms of 
dz and dy. These differentials are connected by the relations 


db = $,da + pydy + $42 + grdt = 0, 
dy = fyda + pb, dy + ys, dz + pdt = 0, 


obtained by differentiating the subsidiary conditions. If we 
assume that the two-rowed determinants occurring here do not 
all vanish at the point (é, 7, ¢, 7), e.g. if we assume that the 
expression 8 is not zero, then we can determine two numbers 
A and p which satisfy the two equations 

D ihe rds + pip, = 0, 

Sit Abit pile = 0. 
If we multiply the equation dé = 0 by A and the equation dys = 0 
by » and add them to the equation df = 0, then by the last two 
equations we obtain 


d(f + Ad + pap) = (fa + Ade + pip,) dx + (f+ Ady + pap,) dy. 


Since here dz and dy are independent differentials (that is, 
arbitrary numbers), it follows that the numbers A and yu also 
satisfy the equations 


fut Ady + pp, = 9, 


and we are once again led to the method of undetermined 
multipliers. 
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In exactly the same way we can state and prove the method 
of undetermined multipliers for an arbitrary number of variables 
and an arbitrary number of subsidiary conditions. The general 
Tule is as follows: 

Lf in a function 

U = f(a, La, ... 5 Vn) 


the n variables X,, Xz, ... , X, are not all independent, but are 
connected by the m subsidiary conditions (m <n) 


(4, Ly, wang Ln) = 0, 
$o(Xy, La, .- +5 Ly) = 0, 


PmlX1, Dea, --+ Ln) = 0, 
then we introduce m multipliers d,, rg... Ag, and equate the 
derivatives of the function 


F= f+ Ady + Asbo +--+ + Amb m 


with respect to X,, Xg,..., X,, when ry, ry, ... , Aq are constant, 
to zero. The equations 


io | a ee 
Oat, 


thus obtained, together with the m subsidiary conditions 
¢; = 0, sey dm = 9, 


represent a system of m-+n equations for the m+n unknown 
quantities X,, Xq,..-+, Xg, Ay, ~~. 5 Ag These equations must be 
satisfied at every extreme value of {, unless at that extreme value 
every one of the Jacobians of the m functions ¢,, bo, -.- , bm with 
respect to m of the variables x,,..., X, has the value zero. 

In connexion with the method of undetermined multipliers 
we have still to make the following important remark. The rule 
gives us an elegant formal method for determining the points 
where extreme values occur, but it merely gives us a necessary 
condition. The further question arises whether and when the 
points which we find by means of the multiplier method do 
actually give us a maximum or a minimum of the function. 
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Into this question we shall not enter; its discussion would lead 
us much too far afield. As in the case of free extreme values, 
when we apply the method of undetermined multipliers we usually 
know beforehand that an extreme value does exist. If, then, 
the method determines the point P uniquely and the exceptional 
case (all the Jacobians zero) does not occur anywhere in the 
region under discussion, we can be sure that we have really 
found the point where the extreme value occurs. 


6. Examples. 


1. As a first example we attempt to find the maximum of the function 
f(x, y, 2) = ay°24 subject to the subsidiary condition a? + y? + 2? = ec, 
On the spherical surface 2? + y? + 22 = c? the function must assume a 
greatest value, and since the spherical surface has no boundary points 
this greatest value must be a maximum in the sense defined above. 
According to the rule we form the expression 


P= a%y't + Nat + y+ 8 — 8), 
and by differentiation obtain 
Qry2? + 22 = 0, 


QxPy22 + 2ray = 0, 
2aty?z + 2az = 0. 


The solutions with x= 0, y = 0, or 2 = 0 can be excluded, for at these 
points the function f takes on its least value, zero. The other solutions 
of the equation are a? = y* = 2?, } = —x*. Using the subsidiary condition, 
we obtain the values 

c 


Saf nes ei yO), sh yell 
eat 7s Ye *- =i 


for the required co-ordinates. 

At all these points the function assumes the same value c*/27, which 
is accordingly the required maximum value. Hence any triad of numbers 
satisfies the relation 

2 2 3 
Vax oP tv tt 


that is, the geometric mean of three positive numbers 2%, y, z* is never 
greater than their arithmetic mean. 

In fact, it is true that for any arbitrary number of positive numbers 
the geometric mean never exceeds the arithmetic mean. The proof is 
similar to that just given.* 


2. As a second example we shall seek to find the triangle (with sides 


‘ 


* For another proof, see Vol. I, Ex. 19, p. 167. 
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%, y, 2) with given perimeter 2s, and the greatest possible area. By a well- 
known formula the square of the area is given by 


f(%, y 2) = 8(8 — 2)(8 — y)(8 — 2). 
We have therefore to find the maximum of this function subject to the 
subsidiary condition 
ep=aet+y+2—%=—90, 


where x, y, 2 are restricted by the inequalities 
220, y20, 220, e+y2%, e4+22y, y+22u. 


On the boundary of this closed region, i.e. whenever one of these in- 
equalities becomes an equation, we always have f= 0. Consequently the 
greatest value of f occurs in the interior and is a maximum. We form 
the function 


F(x, y, 2) = a(s — x)(s — y)(8 — 2) + Met y +2 — 28), 
and by differentiation obtain the three conditions 
—3(8— y)(8—z)+A=0, —s(s — x)(s—z)+21=9, 
—s(s—z)(s—y)+A=0. 
By solving each of these for 1 and equating the three resulting expressions 
we obtain «= y= z= 2s/3; that is, the solution is an equilateral 
triangle. 
3. We shall now prove the following theorem: the inequality 


ww stat lis 
% B 


holds for every u20, v0 and every «>0, B>O for which 


1 1 
ae 5 1. 

The inequality is certainly valid if either u or v vanishes. We may 
therefore restrict ourselves to values of wu and v such that uv + 0. If the 
inequality holds for a pair of numbers 4, v, it also holds for all numbers 
utllt, yfl/8, where ¢ is an arbitrary positive number. We need therefore 
consider only values of u, v for which wy = 1. Hence we have to show 
that the inequality 


Lye i wes 
% B 


holds for all positive numbers u, v such that wv = 1. 


To do this we solve the problem of finding the minimum of Lie + 5 oF 
a 


subject to the subsidiary condition w= 1. This minimum obviously 
exists and occurs at a point (uv, v) where u++0,v+0. A multiplier —2 for 
which the equations 


wos du=0, A1—wW=—0 
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hold therefore exists. On multiplication by u and v respectively these at 

once yield w*= 2, v?a= A, Taken with w=1, these imply that 

w= = 1. ‘The minimum value of the function = ut 5 of is therefore 
a 


1+ ; = 1. That is, the statement that 
Liss wot 
oe 8 

when uv = 1 is proved. 


If in the inequality uo < be Ue +f 5” just proved we replace u and » by 
« 


U; v; 
+ and v= £ 


% a= —__* 5 
(Sug) (Svg)ue 
tm? tm 


respectively, where 1, tgs. » - 5 Uns Vip Vay - - +» Vy ae arbitrary non-negative 
numbers and at least one u and at least one v is not zero, and if we then 
sum the inequalities thus obtained for i= 1,..., , we obtain Hélder’s 
inequality 

n n n 

Duy, S (Dupgy'le( TD vf)Ue, 

=} t= t= 
This holds for any 2n numbers w,, v; where u, = 0, v, 2 0(¢ = 1, 2,...,m), 
not all the w’s and not all the v’s are zero, and the indices a, 6 are such 


that >0,8>0,24121, 
a 6 
4. Finally, we seek to find the point on the closed surface 
9(z, y, 2) = 0 


which is at the least distance from the fixed point (&, y, ¢). If the 
distance is a minimum its square is also a minimum; we accordingly 
consider the function 


F(x, y, 2) = (@ — EP + (y — ny + (2 — SP + Aa, y 2). 
Differentiation gives the conditions 
Wa— E)+ Ap, = 0, Ay— xy) + Ap, = 0, Wz— %)+ Ag, = 0, 


or, in another form, 


a 7 Py 7 9s 
These equations state that the fixed point (&, », ¢) lies on the normal to 
the surface at the point of extreme distance (x, y, z). Therefore in order 
to travel along the shortest path from a point to a (differentiable) surface, 


we must travel in a direction normal to the surface. Of course further 
8* (n912) 
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discussion is required to decide whether we have found a maximum or a 
minimum or neither. (Consider, e.g., a point within a spherical surface. 
The points of extreme distance lie at the ends of the diameter through 
the point; the distance to one of these points is a minimum, to the other 
@ maximum.) 


EXAMPLES 


1. Find the greatest and least distances of a point on the ellipse 
2 
ar? 

from the straight line z+ y—4= 0. 


2. The sum of the lengths of the twelve edges of a rectangular block is 
a; the sum of the areas of the six faces is a#/25. Calculate the lengths of 
the edges when the excess of the volume of the block over that of a cube 
whose edge is equal to the least edge of the block is greatest. 


3. Determine the maxima and minima of the function 
(az? + by2je#*-v® (0 <a <b). 
4, Show that the maximum value of the expression 


ax® + 2bay + cy* 
ea* + 2fay + gy? 


is equal to the greater of the roots of the equation in A 
(ac — 8) — A(ag — 2f + ec) + Weg — f*) = 0. 


(ea — f? > 0) 


5. Calculate the maximum values of the following expressions: 


w + Gay + 3y? at + 2a8y 
a— ay +e’ at + yf 


6. Determine the stationary points of the function 


te, 9) = y*(sine — 5) 


(2) (b) 


and state their nature. 
7*, Find the values of a and b for the ellipse 


of least area containing the circle 
@—lt+y=1 
in its interior. 
8. Find the quadrilateral with given edges a, b, c, d which includes the 
greatest area. 


Ily] MAXIMA AND MINIMA 203 
9. Which point of the sphere 2? 4- 2 + 2? = 1 is at the greatest dis- 
tance from the point (1, 2, 3)? 


10. Let P,P,P,P, be a convex quadrilateral. Find the point O for 
which the sum of the distances from P,, P,, P;, P, is a minimum. 


11. Find the point (a, y, z) of the ellipsoid 


for which 
(7) A+ B+4, 
(b) V(A? + Bt+ C*) 


is a minimum, where A, B, C denote the intercepts which the tangent 
plane at (x, y, z) (x7 > 0, y > 0, 2 > 0) makes on the co-ordinate axes. 
12. Find the rectangular parallelepiped of greatest volume inscribed 
in the ellipsoid 
2a yy 
atatan! 


13, Find the rectangle of greatest perimeter inscribed in the ellipse 
e 
oe! 
14, Find the point of the ellipse 
5a? — Gay + Sy? = 4 
for which the tangent is at the greatest distance from the origin. 
15*. Prove that the length / of the greatest axis of the ellipsoid 
ax? + by® 4- cz* + Bday + Benz + yz = 1 
is given by the greatest real root of the equation 


a p d ée 
1 
d b— 5, f = 0, 
1 
@ f c— 
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Appendix to Chapter III 
1. Surrictenr Conprrions ror ExTREME VALUES 


Tn the theory of maxima and minima in the preceding chapter 
we have contented ourselves with finding necessary conditions 
for the occurrence of an extreme value. In many cases occurring 
in actual practice the nature of the “ stationary ” point thus 
found can be determined from the special nature of the problem, 
and we can thus decide whether it is a maximum or a minimum. 
Yet it is important to have general sufficient conditions for the 
occurrence of an extreme value. Such criteria will be developed 
here for the typical case of two independent variables. 

If we consider a point (2, yo) at which the function is 
stationary, that is, a point at which both first partial derivatives 
of the function vanish, the occurrence of an extreme value 
is connected with the question whether the expression 


F(a + 4, Yo + k) —f (Xo. Yo) 


has or has not the same sign for all sufficiently small values of 
h and k. If we expand this expression by Taylor’s theorem 
(Chap. II, p. 80), with the remainder of the third order, in virtue 
of the equations f,(%9, Yo) = 0 and f(x, ¥9) = 0 we at once obtain 


So +h, Yo k) —F Gor Yo) = 2 Gen + Ahhifay + Mf) + €p*, 


where p* = h? + k* and ¢ tends to zero with p. 

From this we see that in a sufficiently small neighbourhood 
of the point (%», yo) the behaviour of the functional difference 
S(@o +h, Yo tk) —f(X, Yo) is essentially determined by the 
expression 


Qh, k) = ah? +- 2bhk + ck’, 
where for brevity we have put 
a= fexlZo Yo) 0=fev(%o Yo), ©=Syu(®o Yo) 


In order to study the problem of extreme values we must 
investigate this homogeneous quadratic expression in h and k, 
or, a8 we say, the quadratic form @. We assume that the 
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coefficients a, 6, c do not all vanish. In the exceptional case 
where they do all vanish, which we shall not consider, we must 
begin with a Taylor series extending to terms of higher order. 

With regard to the quadratic form Q there are three different 
possible cases: 

1. The form is definite. That is, when h and & assume all 
values, Q assumes values of one sign only, and vanishes only 
for h== 0, k= 0. We say that the form is positively definite or 
negatively definite according as this sign is positive or negative. 
For example, the expression h? + k?, which we obtain when 
a=c=1, b=0, is positively definite, while the expression 
—h® + Qhk — 2k® = —(h — kh)? — I? is negatively definite. 

2. The form is indefinite. That is, i+ can assume values of 
different sign, e.g. the form @ = 2hk, which has the value 2 for 
h==1, k= 1 and the value —2 for h= —1, k= 1. 

3. Finally, there is still a third possibility, namely that in 
which the form vanishes for values of h, & other than h = 0, 
k= 0, but otherwise assumes values of one sign only, e.g. the 
form (h -+ k)?, which vanishes for all sets of values h, & such 
that h = —k. Such forms are called semi-definite. 

The quadratic form Q = ah® + 2bhk + ck* is definite if, and 
only if, the condition Pree er 
is satisfied; it is then positively definite if a > 0 (so that c > 0 
also), otherwise it is negatively definite. 

In order that the form may be indefinite it is necessary and 
sufficient that 
ac— <0, 


while the semi-definite case is characterized by the equation * 
ac — 6? = 0. 


* These conditions are easily obtained as follows. Either a=e=0, in 
which case we must have 6 + 0, and the form is, as already remarked, indefinite; 
the criterion therefore holds for this case: or else we must have, say, a + 0; 
we can then write 


2 — p2 

ah? + 2bhk + ck® = af (a + 2k) +a5% |. 
This form is obviously definite if ca ~ 6% > 0, and it then has the same sign 
asa. It is semi-definite if ca — 6? = 0, for then it vanishes for all values of 


h, k that satisfy the equation h/k = ~6/a, but for all other values it has the 
same sign. It is indefinite if ca — b* < 0, for it then assumes values of different 


sign when & vanishes and when h + rs k vanishes. 


206 DEVELOPMENTS AND APPLICATIONS  [Cuap. 


We shall now prove the following statements. If the quadratic 
form Q(h, k) is positively definite, the stationary value assumed 
for h = 0, k = 0is a minimum. If the form is negatively definite, 
the stationary value is a maximum. If the form is indefinite, we 
have neither a maximum nor a minimum; the point is a saddle 
point. Thus, definite character of the form Q is a sufficient con- 
dition for an extreme value, while indefinite character of Q 
excludes the possibility of an extreme value. We shall not 
consider the semi-definite case, which leads to involved dis- 
cussions. 

In order to prove the first statement we have only to use 
the fact that if Q is a positively definite form there is a positive 
number m, independent of h and k, such that* 


Q = Am(h? + k*) = 2mp?. 
Therefore 


(> +h, yo +k) — fo; Yo) = 39h, k) + ep? = (m+ e)p*. 


If we now choose p so small that the number « is less in absolute 
value than 4m, we obviously have 


Fay th, yo tk) — f(%e Yo) = 5 ee 


Thus for this neighbourhood of the point (xp, yo) the value of the 
function is everywhere greater than f(x», yo), except of course at 
(%, Yo) itself. In the same way, when the form is negatively 
definite the point is a maximum. 

Finally, if the form is indefinite, there is a pair of values 
(h,, k,) for which @ is negative and another pair (h,, k,) for which 
Q is positive. We can therefore find a positive number m such 


that 
Oh; k)< —2mp,, 
Q(he, ky) > 2mp,*. 
If we now put h= th, k= tk, p?=h? + i (t+ 0), that is, if 
ae = as a function of the two 
Then u? + v? = J, and the form 


*To see this we consider the quotient 


ais h 

quantities u VUF +B) and v VUF+By 
becomes a continuous function of wand v, which must have a least value 2m on 
the circle u® + v? = 1. This value m obviously satisfies our conditions; it is 
not zero, for on the circle u and v never vanish simultaneously. 
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we consider a point (x) + h, yo + k) on the line joining (29, Yo) 
to (Zo + Ay, Yo + &), then from Q(h, &) = PQ(h,, ky) and p? = #p,? 
we have 

Q(h, k) < —2mp?. 


Thus by choice of a sufficiently small t (and corresponding p) 
we can make the expression f(% + h, ¥y + &) —f (Xp, Yo) negative. 
We need only choose ¢ so small that for h= th, k= tk, the 
absolute value of the quantity ¢ is less than 4m. For such a set 
of values we have f(%)-+ h, Yyp +.2—f (Xo, Yo) <—mp?/2, 80 that 
the value f(z) -+ h, yo + &) is less than the stationary value 
(0: Yo). In the same way, on carrying out the corresponding 
process for the system h = tha, k = tka, we find that in an arbi- 
trarily small neighbourhood of (2%, yp) there are points at which 
the value of the function is greater than f(z 9, yp). Thus we have 
neither a maximum nor a minimum, but instead what we may 
call a saddle value. 

If a= b=c= 0 at the stationary point, so that the quad- 
ratic form vanishes identically, and also in the semi-definite case, 
this discussion fails to apply. To obtain sufficient conditions for 
these cases would lead to involved calculations. 

Thus we have the following rule for distinguishing maxima 
and minima: 

If at a point (Xo, Yo) the equations 


Fel®, Yo) = 9, fr(Xo, Yo) = 0 
hold, and also the inequality 
Seat vy — fey? > 0, 


then at that point the function has an extreme value. This is a 
maximum if f,, <0 (and consequently f,, <0), and a minimum 
of £,, > 0. 

Lf, on the other hand, 


SeaSvy — fa? <9, 


the stationary value is neither a maximum nor a minimum. The 


case 
f. wad: vw f. ay — 0 
remains undecided. 
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These conditions permit of a simple geometrical interpretation. 
The necessary conditions f,=f,== 0 state that the tangent 
plane to the surface z = f(z, y) is horizontal. If we really have an 
extreme value, then in the neighbourhood of the point in question 
the tangent plane does not intersect the surface. In the case 
of a saddle point, on the contrary, the plane cuts the surface in 
a curve which has several branches at the point. This matter 
will be clearer after the discussion of singular points in the next 
section. 


As an example we seek to find the extreme values of the function 
Se 9) = P+ ay + y? + ax + by. 
If we equate the first derivatives to zero, we obtain the equations 
2a+yta=0, «+ %+b=0, 
which have the solution «= 4(b — 2a), y= 4(a — 2b). The expression 
Fecafuu — Seu? = 3 


is positive, as is f,,.== 2. The function therefore has a minimum at the 
point in question. 
The function 
f(y) = (y — #P + 28 


has a stationary point at the origin. There the expression fix fyy — fry” 
vanishes, and our criterion fails. We readily see, however, that the function 
has not an extreme value there, for in the neighbourhood of the origin 
the function assumes both positive and negative values. 

On the other hand, the function 


f(x, y) = (@ — 9) + (y— If 


has a minimum at the point «= 1, y= 1, though the expression 
Sead vy — Sey? Vanishes there. For 


f+ h, 1+ k)— fi, I= (h—- b+ B, 
and this quantity is positive when o + 0. 


ExaMPLre 
If f(a) = k + 0, 6/(a) + 0, and 2, y, z satisfy the relation 
P(x) Ply) H(z) = &, 


prove that the function 


F(@) + fy) + f@ 
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has a maximum when z = y = z = a, provided that 


un (P (a) _ $a) a 
f(@) (a) ca) > f(a). 


2. Smncutar Ports or PLANE CURVES 


In Chap. III, section 2 (p. 128) we saw that a curve f(x, y) = 0 
in general has a singular point at a point = 2%», y = Yo such 
that the three equations 


og (Zo, Yo) = 9, I (Xo, Yo) = 0, Si (Zo, Yo) =0 


hold. In order to study these singular points systematically, we 
assume that in the neighbourhood of the point in question the 
function f(z, y) has continuous derivatives up to the second 
order, and that at that point the second derivatives do 
not all vanish. By expanding in a Taylor series up to terms 
of the second order we obtain the equation of the curve in 
the form 


Of (a, y) = (& — %o)*Fae(@o, Yo) + 2(@ — 2) (Y — Yo) fev(For Yo) 
+ (y¥ — Yo)*fuvlZo» Yo) + <p? = 9, 


where we have put p? = (x — 2 )*-+ (y — yo)* and ¢ tends to 
zero with p. 

Using a parameter ¢, we can write the equation of the general 
straight line through the point (a9, y9) in the form 


L—t=—a, y—yo= Ut, 


where a and 6 are two arbitrary constants, which we may suppose 
to be so chosen that a?-++ b?=1. To determine the point of 
intersection of this line with the curve f(x, y) = 0 we substitute 
these expressions in the above expansion for f(z, y); for the 
point of intersection we thus obtain the equation 


O° fog + Qabt fy + D2 fyy + ef? = 0. 


A first solution is t= 0, ie. the point (x, Yo) itself, as is 
obvious. It is, however, worthy of notice that the left-hand 
side of the equation is divisible by ¢, so that ¢ is a “ double root ” 
of the equation. For this reason the singular points are also 
sometimes called “ double points ” of the curve. 
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If we remove the factor é, we are left with the equation 
at i 2abfay + Phy +e=0. 


We now inquire whether it is possible for the line to intersect 
the curve in another point which tends to (xy, yp) as the line 
tends to some particular limiting position. Such a limiting 
position of a secant we of course call a tangent. To discuss this, 
we observe that as a point tends to (xz, yo) the quantity ¢ 
tends to zero, and therefore ¢ also tends to zero. If the equation 
above is still to be satisfied, the expression a2f,, +- 2abfn, + fy, 
must also tend to zero; that is, for the limiting position of the 
line we must have 


O faa + Zabfay + Defy = 0. 


This equation gives us a quadratic condition determining the 
ratio a/b which fixes the line. 
If the discriminant of the equation is negative, that is, if 


Janko — Sev? <0, 


we obtain two distinct real tangents. The curve has a double point or 
node, like that exhibited by the lemniscate (22+ y)?— (x?— y®)= 0 
at the origin or the strophoid (2? + y?)(#— 2a)+ aa = 0 at 
the point 2) = a, y, = 0. 

If the discriminant vanishes, that is, if 


Feafyy — far? = 0, 


we obtain two coincident tangents; it is then possible e.g. that 
two branches of the curve touch one another, or that the curve 
has a cusp. 
Finally, if 
7 Seefuy — fev? > 9, 
there is no (real) tangent at all. This occurs e.g. in the case of the 
so-called isolated points or conjugate points of an algebraic curve. 
These are points at which the equation of the curve is satisfied, 
but in whose neighbourhood no other point of the curve lies. 


The curve (2* — a3)? + (y? — 6%)? = a*-+ b¢ exemplifies this. The 


values z = 0, y = 0 satisfy the equation, but for all other values in the 
region | z | < av 2, | y |< V2 the left-hand side is less than the right. 


We have omitted the case in which all the derivatives of the 
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second order vanish. This case leads to involved investigations, 
and we shall not consider it. Through such a point several 
branches of the curve may pass, or singularities of other types 
may occur. 

Finally, we shall briefly mention the connexion between 
these matters and the theory of maxima and minima. Owing to 
the vanishing of the first derivatives, the equation of the tangent 
plane to the surface z= f(z, y) at a stationary point (x9, Yo) is 
simply 

z—f (Xp, Yo) = 9. 


J (x, y) —f(%o, Yo) = 9 


therefore gives us the projection on the zy-plane of the curve of 
intersection of the tangent plane with the surface, and we see 
that the point (x9, yo) is a singular point of this curve. If this is 
an isolated point, in a certain neighbourhood the tangent plane 
has no other point in common with the surface, and the function 
f(x, y) has a maximum or a minimum at the point (x, Yo) (cf. 
p. 208). If, however, the singular point is a multiple point, the 
tangent plane cuts the surface in a curve with two branches, and 
the point corresponds to a saddle value. These remarks lead us 
precisely to the sufficient conditions which we have already 
found in section 1 (p. 207). 


The equation 


3. SmneuLAR Points of SuRFACES 


Tn a similar way we can discuss a singular point of a surface 
f(@, y, 2) = 0, Le. a point for which 


f= 0, fe=fp=fie= 0. 


Without loss of generality we may take the point as the origin O. 
If we write 


Sow = 0; fov = By fos = Vs fou = As Sus = Mo Sus = ¥ 
for the values at this point, we obtain the equation 
ax? + By® + y2? + 2rxy + 2unyz + 2vaz= 0 


for a point (x, y, 2) which lies on a tangent to the surface at O. 
This equation represents a quadratic cone touching the 
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surface at the singular point—instead of the tangent plane at an 
ordinary point of the surface—if we assume that not all of the 
quantities a, 8, ..., v vanish and that the above equation has 
teal solutions other than «= y= z= 0. 


4. CONNECTION BETWEEN EULER’S AND LAGRANGE’S 
REPRESENTATIONS OF THE MoTION oF A FLUID 


Let (a, b, c) be the co-ordinates of a particle at the time 
t= 0 in a moving continuum (liquid or gas). Then the motion 
can be represented by three functions 


= x(a, b, ¢, t), 


y= y(a, b, ¢, t), 
z= 2(a, b, ¢, t), 


or in terms of a position vector x = x(a, b, ¢, t). Velocity and 
acceleration are given by the derivatives with respect to the 
time t. Thus the velocity vector is # with components @, ¥, Z, 
and the acceleration vector is % with components #, #, 2, all of 
which appear as functions of the initial position (a, b, c) and the 
parameter ¢t. For each value of t we have a transformation of 
the co-ordinates (a, b, c) belonging to the different points of the 
moving continuum into the co-ordinates (x, y, z) at the time t. 
This is the so-called Lagrange representation of the motion. 
Another representation introduced by Euler is based upon the 
knowledge of three functions 


u(x, y, 2, t), v(x, y, 2 t), w(x, Y, %, t) 


representing the components &, ¥, 2 of the velocity % of the motion 
at the point (2, y, z) at the time ¢. 

In order to pass from the first representation to the second 
we have to use the first representation to calculate a, b, cas 
functions of 2, y, z, and ¢, and to substitute these expressions in 
the expressions for <(a, b, c, t), y(a, 6, ¢, t), 2(a, 6, ©, t): 


u(x, y, 2, t) = &fa(z, y, 2, t), b(a, y, 2, t), oa, y, 2, t), t}, &e. 
We then get the components of the acceleration from 
a(a, b, c, t) = uf{a(a, b, ¢, t), y(a, b, ¢, t), 2(a, b, ¢, t), t}, &e. 
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as follows: 

= Ug + uyy + ug+ up, &e., 
or 

B= Ugh + Uyv + Uw + Up 

G = Vb + V0 + 0,0 + Vp 

z= w,u + wiv + ww + Wy 


In the mechanics of a continuum the following equation con- 
necting Euler’s and Lagrange’s representations is fundamental: 


div # = u,-+ vy = > 


where 
O(2, y, 2) 
O(a, 6, ¢): 


is the Jacobian characterizing the motion. 

The reader may complete the proof of this and the corre- 
sponding theorem in two dimensions by using the various rules 
for the differentiation of implicit functions. 


D(x, y, 2, t) = 


5. TanqEntiaL REPRESENTATION OF A CLOSED CURVE 


A family of straight lines with parameter a may be given by 
xcosa+ ysina—pia)=0,... . (1) 


where (a) denotes a function which is twice continuously differ- 
entiable and periodic of period 27 (a so-called tangential function). 
The envelope C of these lines is a closed curve satisfying (1) and 
the further equation 


—zsma-+ y cosa — p’(a) = 0. 
Hence 
2= p cosa — p’ =) (2) 
y=psina+p'cosa) © © * * * 


is the parametric representation of CO (a being the parameter). 
Formula (1) gives the equation of the tangents of C and is referred 
to as the tangential equation of C. 
Since 
e = —(p+ p") sina, y' = (p+ p”’) cosa, 
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we at once have the following expressions for the length Z and 
area A of C: 


L =["(p+p"\da =| "pda, 


Qr Qr Qa 
A= if (zy’ — yx’) da = if (p+ p")pda = if (p?— p”) da, 


since p’(a) is also a function of period 27.* 
From this we deduce the isoperimetric inequality 


I? > 4nA, 


where the equality sign holds for the circle only. This may also 
be expressed by the statement: among all closed curves of given 
length the circle has the greatest area. 

For the proof we make use of the Fourier expansion of p(a) 
(Vol. I, Chap. IX, p. 447), 

pa) = S + % (a, cosva + 8, sinva); 

a v=] 

then 


p'(a) = & v(b, cosva — a, sinva), 
v=1 


so that (using the orthogonality relations of Vol. I, p. 438) we have 


L = to, 
4=7(@—E08— ott 2,4). 
Thus : ‘ 
4sP-7 


2 ~ ° 
in particular, 4 = z only if a,=6b,=0 for v2, ie. 
TT 
p(a) = a+ a, cosa-+ b, sina; the latter equation defines a 
circle, as is easily proved from (2). 


* Since p(a)-+c¢ is obviously the tangential function of the parallel curve at a 
distance c from C, the formuls for the area and the length of a parallel curve 
(cf. Vol. I, p. 291, Ex. 22, and p. 553) are easily derived from these expressions. 


CHAPTER IV 
Multiple Integrals 


The idea of differentiation and the operations with derivatives 
in the case of functions of several variables are obtained almost 
immediately by reduction to their analogues for functions of one 
variable. As regards integration and its relation to differentiation, 
on the other hand, the case of several variables is more involved, 
since the concept of integral can be generalized for functions of 
several variables in a variety of ways. In this chapter we shall 
study multiple integrals such as we have already met in Vol. I, 
Chap. X (p. 486). In addition to these, however, we have also to 
consider the so-called line integrals in the plane, and surface 
integrals, as well as line integrals, in three dimensions (Chap. V, 
p. 343). In the end, however, it is found that all questions of 
integration can be reduced to the original concept of the integral 
in the case of one independent variable. 


1. Ornprinary InTEGRALS aS FUNCTIONS OF A PARAMETER 


Before we study the new situations which arise with functions 
of more than one variable, we shall discuss some concepts which 
are directly related to matters already familiar to us. 


1, Examples and Definitions. 


If f(z, y) is a continuous function of x and y in the rectangular 
regiona <a f,aSy Sb, we may in the first instance think 
of the quantity x as fixed, and we can then integrate the 
function f(z, y), which is now a function of y alone, over the 
interval a Sy <b. We thus arrive at the expression 


[ fen 


which still depends on the choice of the quantity x. In a sense, 
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therefore, we are considering not an integral but the family of 
integrals f JS (a, y)dy which we obtain for different values of a. 


This quantity, which is kept fixed during the integration and to 
which we can assign any value in its interval, we call a parameter. 
Our ordinary integral therefore appears as a function of the para- 
meter x. 

Integrals which are functions of a parameter frequently 
occur in analysis and its applications. 


Thus, as the substitution zy = w readily shows, 
uA 7 aw = arosinaz. 
o Vil — ay") 
Again, in integrating the general power function we may regard the index 
as @ parameter and write accordingly 


fre 


where we assume that x > — 1. 


If we represent the region of definition of the function f(z, y) 
geometrically, and make 
the parallel to the y-axis 
corresponding to the fixed 
value of a intersect the 
rectangle as in fig. 1, then 
we obtain the function of 
y which is to be integrated 
by considering the values of 
the function f(#, y) a8 a 
function of y along the line 
Fig. 1 of intersection AB. We 
may also speak of integrat- 

ing the function f(z, y) along the segment AB. 

This geometrical point of view suggests a generalization. If 
the region of definition R in which the function f(z, y) is con- 
sidered is not a rectangle, but instead has the shape shown in 
fig. 2 (that is, if any parallel to the y-axis cuts the boundary in 
at most two points), then for a fixed value of 2 we can again 


integrate the values of the function f(x, y) along the line 4B in 


| 
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which the parallel to the y-axis intersects the region of definition 
R. The initial and final points of the interval of integration 


Fig. 2 


will themselves vary as x varies. In other words, we have to 
consider an integral of the type 


a(x) 
{ f@ yy = Fe), 
Yox(%) 


that is, an integral with the variable of integration y and the 
parameter x, in which the parameter occurs both in the integrand 
and in the limits of integration. 


If, for example, the region of definition is a circle with unit radius and 
centre the origin, we shall have to consider integrals of the type 


+V(1—x%) 
f f(z, y)dy. 
—Vv(i—x!) 


2. Continuity and Differentiability of an Integral with respect to 
the Parameter. 


The integral 
F(a) =f fle, yay 
is a continuous function of the parameter x, if {(x, y) is continuous 


in the region im question. 
For 


F(a + h) — F(e) 


a fret h, y) — f(a, Way] 


< '|fe +h — Fe n/a. 
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In virtue of the (uniform) continuity of f(x, y), for sufficiently 
small values of h the integrand on the right, considered as a 
function of y, may be made uniformly as small as we please, 
and the statement follows immediately. In particular, therefore, 
we can integrate the function F(x) with respect to the parameter 
x between the limits a and f, obtaining 


[Fee =['([te. y) dy ) dx. 


The integral on the right we also write in the form 


[ fre. y) dy da; 


we call it a repeated integral or multiple integral (in this case a 
double integral). 

We now investigate the possibility of differentiating F(z). 
In the first place, we consider the case where the limits are 
fixed and assume that the function f(s, y) has a continuous 
partial derivative f,, throughout the closed rectangle R. It is 
natural to try to form the z-derivative of the integral in the 
following way: instead of first integrating and then differentiating 
we reverse the order of these two processes, that is, we first dif- 
ferentiate f with respect to x and then integrate with respect to y. 
As a matter of fact, the following theorem is true: 

Tf in the closed rectanglea SxS B, ay Sb the function 
f(x, y) has a continuous derivative with respect to x, we may dif- 
ferentiate the integral with respect to the parameter under the integral 
sign,* that is, fax B, 


£ Fo) = £ is F(a, y)dy = i Flv, y) dy. 


* From this we obtain a simple proof of the fact, which we have already 
proved (Chap. II, p. 56), that in the formation of the mixed derivative g,, of 
a function g(x, y) the order of differentiation can be changed, provided that 
Jey is continuous and g, exists. For if we put f(x, y) = 9,(%, y), we have 


y 
gle, y) = g(a, a) + f f(x, n)dn. 


Since f(z, y) has a continuous derivative with respect to z in the rectangle 
eS2eS fay b, it follows that 


Yq = Igl% a) + [tee n)dn, 
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Proof. If both x and « + h belong to the interval a SxS 8, 
we can write 


Foti) Feo=f fet hb ndy— [feng 


=f {ye +h, 9) — f(a, y)} dy. 


Since we have assumed that f(z, y) is differentiable, the mean 
value theorem of the differential calculus in its usual form 
gives * 


f(e+h, y) —f (a, y) = hfe + Oh, y), O<O<1. 


Moreover, since the derivative f, is assumed to be continuous in 
the closed region and therefore uniformly continuous, the absolute 
value of the difference 


f(x + 6h, y) — fla, y) 


is Jess than a positive quantity « which is independent of x and 
y and tends to zero with h. Thus 


ey ha Fe) a if Sux, y)dy 


f fel + 01, y)dy — [f.le, yay 


If we now let h tend to zero, « also tends to zero, and the 
relation 


6 
=} edy = «(b— a), 


lim Ret Nae == [. Sol, y)dy = F(a) 


h—>0 


at once follows; our statement is thus proved. 
In a similar way we can establish the continuity of the integral 
and the rule for differentiating the integral with respect to a 


and therefore 
Jue = Flt y)- 
In the same way, In, = f,(% y), and therefore Jey = Pyar 


* Here the quantity @ depends on y, and may even vary discontinu- 
ously with y. This does not matter, for by the equation f,(% + 0h, y) = 
h-\f(x + h, y) — f(z, y)) we see at once that f,(z + 0h, y) is a continuous 
function of z and y, and is therefore integrable. 
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parameter when the parameter occurs in the limits. If, for 
example, we wish to differentiate 


hax) 
F(e)=[ f(a, y)dy, 
W(x) 
we start with the expression 
F(a) = f f(x, y)dy = ©(u, », 2), 


where u = y,(x), v= ¥,(x). Here we assume that ¥,(2) and 
(7) have continuous derivatives with respect to z throughout 
the interval and that f(x, y) is continuously differentiable (cf. 
p. 62) in a region wholly enclosing the region R. By the chain 
rule we now obtain 


F@=24 0D du . o@ dv 


du dx ' Ov da’ 


If we apply the fundamental theorem of the integral calculus 
(Vol. I, p. 111), this gives the formula 


Pa) = [fees vhdy — Hafele) + $a ehh Gs le 
Thus if for F(a”) we take the function 


F(a) = f sin (xy) dy, 
we obtain 
oe) -[° y cos (wy) dy + sin (x). 


If we take 
_ ady 


F(a) =f Va— a) — wyi) = are sing, 


we obtain the relation 
1 dy 1 
F(z) = f ~——— 
=), Ja ay Va ay 
as the reader can verify directly. 
Other examples are given by the a. 


Fae) = [™ pana, 


F(x) = f f(y)dy, 
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where n is any positive integer and f(y) is a continuous function of y only 
in the interval under consideration. Since the expression arising from 
differentiation with respect to the upper limit 2 vanishes, the rule gives 
us 

Fyi(t) = Fy-l(2)- 

Since F(x) = f(x), this at once gives 

F,*) (z) = f(a). 
Therefore F,,(a) is the function whose (n + 1)-th derivative is equal to 
f(a) and which, together with its first x derivatives, vanishes when x = 0; 
it arises from F,,_,(x) by integration from 0 tox. Hence F,,(x) is the function 
which is obtained from f(x) by integrating n+ 1 times between the limits 0 
and x. This repeated integration can therefore be replaced by a single 


integration of the function “___-~- aes —* " T(y) with respect to y. 

The rules for pres an integral. with respect to a 
parameter often remain valid even when differentiation under 
the integral sign gives a function which is not continuous every- 
where. In such cases, instead of applying general criteria, it is 
more convenient to verify whether such a differentiation is per- 
missible in each special case. 


As an example we consider the elliptic integral (cf. Vol. I, p. 243) 


+1 dx 
Fe) =f Va—=) @— bas)’ (2 <1). 


The function 


1 
f(k, 2) = Vana G2 Ba) 


is discontinuous at = +1 and atx = —1, but the integral (as an improper 
integral) has a meaning. Formal differentiation with respect to the os 
meter & gives 


Pb) -{ Va— = = — Bais 


To investigate whether this equation is correct, we repeat the argument 
by which we obtained our differentiation formula. This gives 


F(k+ h)— F(k) +1 +1 (k + Oh)a® dz 
ph ast oh ad SE k+ 0h, 2)de=f 9 ——— se, 
h is date} 9) - V/(1— 22) (1— (k-+ 0h)224)2 
The difference between this expression and the integral obtained by formal 
differentiation is 
1 Vi-#2@\V(1— (b+ yee VO— Rep] 
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We must show that this integral tends to zero with h. For this purpose 
we mark off about & an interval kj < k < k, not containing the values 
+1, and we choose hf so small that &-+ 6” lies in this interval. The 
function 

k 


V(1 — ex 


is continuous in the closed region —1 Sz <1, ky Sk S ky, and is there- 
fore uniformly continuous. The difference 


k-+ Oh = k 
V(L— (e-- Ohya) 4/1 — Pat 


consequently remains below a bound « which is independent of z and k 
and which tends to zero with kh. Hence the integral A also remains less 
in absolute value than 
+1 2 
f palbicsd e= Me, 
-1 


V1 — 2 


where If is a constant independent of c. That is, the integral A tends to 
zero as h does, which is what we wished to show. 

Differentiation under the integral sign is therefore permissible in 
this case. Similar considerations lead to the required result in other 
cases. 

Improper integrals with an infinite range of integration are discussed 
in the Appendix to this chapter, § 4, p. 307. 


ExaMpiys 
1. Evaluate 


1 
Fly) = f av (y log x +1) da, 


2. Let f(z, y) be twice continuously differentiable, and let u(x, y, 2) 
be defined as follows: 
20 
ua, y, 2) = B f(% + zcosg, y + zsing)do. 
Prove that 
2 (Ugg + My — Use) — Uz = 0. 


3*, If f(z) is twice continuously differentiable and 
1 tt p=8 
ue =f fetne—x) = dy (>, 
-t 
prove that 


p—l 
t 


Ugg = Uy + thy. 
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4, The Bessel function J (x) may be defined by 


I(x) = 


ae Vaan a— = i 
Prove that 


1 
So + zJe + Jo = 0. 


5. For any non-negative integral index n the Bessel function J,(x) may 
be defined by 


Prd +1 .; 7 
= — na 
J (2) 1.308 aes cos at (1 — #)"—-* dt, 
Prove that 
1 n? 
(a) In’ tity +(1—S)s,=0 20, 
(6) Int = Ina — 2’ (n 21) 
and J, = —Jdy’. 


2. Toe Inreacrat or a Continuous FUNCTION OVER A 
REGION OF THE PLANE oR OF SPACE 


1, The Double Integral (Domain Integral) as a Volume. 


The first and most important generalization of the ordinary 
integral, like the ordinary integral itself, is suggested by geo- 
metrical intuition. Let R be a closed region of the zy-plane, 
bounded—as we assume all along—by one or more ares of curves 
with continuously turning tangents, and let z= f(z, y) be a 
function which is continuous in 2. We assume in the first instance 
that f is non-negative, and represent it by a surface in zyz-space 
vertically above the region R. We now wish to find (or, more 
precisely, to define, since we have not yet done so) the volume V 
below the surface. This has been done in detail for rectangular 
regions in Vol. I, Chap. X (p. 486), and, moreover, the case is so 
similar to that of the ordinary integral that we feel justified in 
mentioning it somewhat briefly here. The student will see at once 
that a natural way of arriving at this volume is to subdivide R 
into N sub-regions R,, R2, ..., Ry, each having boundaries that 
are sectionally smooth (p. 41), and to find the greatest value M, 
and the least value m, of f ineach region R;. The areas of the 
regions R, we denote by AR;. On each region R, as base we con- 
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struct a cylinder of altitude M;. This set of cylinders completely 
encloses the volume under the surface. Again, with each region 
R, as base we construct a cylinder of altitude m,, and hence 
with volume m;AR;; these cylinders lie completely within the 
volume under the surface. Then 


Em, AR, < V SEM, AR,. 
1 1 


These sums Xm,AR, and &M,AR, we call the lower and wpper 
sums respectively. 

If we now make our subdivision finer and finer, so that the 
number N increases beyond all bounds, while the greatest dia- 
meter of the regions FR, (that is, the greatest distance between 
two points of R,) at the same time tends to zero, we see intuitively 
(and shall later prove rigorously) that the upper and lower sums 
must approach one another more and more closely, so that the 
volume V can be regarded as the common limit of the upper and 
lower sums as N tends to o. 

We can obviously obtain the same limiting value if instead 
of m, or M, we take any number between m, and M,, e.g. f(xy y,), 
the value of the function at a point (x;, y,) in the region R,. 


2. The General Analytical Concept of the Integral. 


These concepts suggested by geometry must now be studied 
analytically and made more precise without direct reference to 
intuition. We accordingly proceed as follows. We consider a 
closed region R with area AR, and a function f(x, y) which is 
defined and continuous everywhere in R, including the boundary. 
As before, we subdivide the region by sectionally sraooth arcs * 
into N sub-regions R,, Ry, ..., Ry with areas AR,,..., AR,,. 
In R; we choose an arbitrary point (€;, »;) where the function 
has the value f, = f(&,, 7;) and we form the sum 


N 
V, = =f ; AR, 
The fundamental theorem is then as follows: 


If the number N increases beyond all bounds and at the same 


* Le. ares which are given in a suitable co-ordinate system by an equation 
y = (x), where ¢ is a continuous function whose derivative is continuous except 
for a finite number of jump discontinuities (cf. p. 41). 
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time the greatest of the diameters of the sub-regions tends to zero, 
then V,, tends to a limit V. This limit is independent of the par- 
ticular nature of the subdivision of the regions R and of the choice 
of the point (&, n;) in R, The limit V we call the (double) integral 
of the function f(x, y) over the region R: in symbols, 


fff was. 


Corollary. We obtain the same limit if we take the sum 
only over those sub-regions R, which lie entirely in the interior 
of R, that is, which have no points in common with the boundary 
of R. 

This existence theorem for the integral * of a continuous 
function must be proved in a purely analytical way. The proof, 
which is very similar to the corresponding proof for one variable, 
is given in the appendix to this chapter (p. 293). 

We shall now illustrate this concept of an integral by consider- 
ing some special subdivisions. The simplest case is that in which 
R is a rectangle asia <b, eS y Xd and the sub-regions R, 
are also rectangles, formed by subdividing the z-interval into 
nm equal parts and the y-interval into m equal parts, of lengths 


pote ea goes 
n 


The points of subdivision we call 29 = @, %, 2g, .-+ » Mn = b and 


* We can refine this theorem further in a way which is useful for many 
purposes. In the subdivision into N sub-regions it is not necessary to choose & 
value which is actually assumed by the function f(z, y) at a definite point 
(£;, 74) of the corresponding sub-region; it is sufficient to choose values which 
differ from the values of the function f(¢,7,) by quantities which tend uniformly 
to zero as the subdivision is made finer. In other words, instead of the values 
of the function f(é,, 7,) we can consider the quantities 


fi = Slép ™) + &,y 
where | €, y|< €y jim €y = 0. (The number ¢, y is therefore the difference 
He 


between the value of the function at a point of the i-th sub-region of the sub- 
division into N sub-regions and the quantity f,; with which we form the sum.) 
This theorem is almost trivial; for, since the numbers «;, tend uniformly to 
zero, the absolute value of the difference between the two sums 


x x 
Zh; AR, and U(f + €; y) AR, 


is less than ey D> AR;, and can be made as small as we please if we take the 
number N sufficiently large. E.g. if we have f(x, y) = P(x, y) Q(z, y) we may 
take f; = P;Q;, where P; and Q; are the maxima of P and Q in &, which are 
in general not assumed at the same point. 

9 (5912) 
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Yo=C, Yrs Yor --+ > Ym== 4 respectively, and through these 
points we draw parallels to the y-axis and the z-axis respectively. 
We then have N = nm. All the sub-regions are rectangles with 
area AR; = hk = Ardy, if we put h= Az, k= Ay. For the 
point (é,, 7,) we can take any point in the corresponding rect- 
angle, and we then form the sum 


2f(&e, n)AxAy 


for all the rectangles of the subdivision. 

If we now let n and m simultaneously increase beyond all 
bounds, the sum will tend to the integral of the function f over 
the rectangle R. 

These rectangles can also be characterized by two suffixes 
p and vy, corresponding to the co-ordinates s=a-+ vh and 
y=ce-+ pk of the lower left-hand corner of the rectangle in 
question. Here v assumes integral values from 0 to (nm — 1) and 
p from 0 to (m— 1). With this identification of the rectangles 
by the suffixes v and ~ we may appropriately write the sum as 


a double sum * 
1 


n—1 m— 
x = f(é,7,)AvAy. 
v=0 p=0 
Even when & is not a rectangle, it is often convenient to 
subdivide the region into rectangular sub-regions #,;. To do this 
we superpose on the plane the rectangular net formed by the 
lines 
e=vh (v=0,+1,+2,...) 
y=phe (p=0,4+1,42,...), 


where h and k are numbers chosen arbitrarily. We now consider 
all those rectangles of the division which lie entirely within R. 
These rectangles we call R,. Of course they do not completely 
fill the region; on the contrary, in addition to these rectangles R 
also contains certain regions R, adjacent to the boundary which 
are bounded partly by lines of the net and partly by portions of 
the boundary of R. But by the corollary on p. 225 we can cal- 
culate the integral of the function f over the region Ff by summing 
over the interior rectangles only and then passing to the limit. 


* If we are to write the sum in this way, we must suppose that the points 
(£, ,;) are chosen so as to lie in vertical or horizontal straight lines. 
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Another type of subdivision which is frequently applied is 
the subdivision by a polar co-ordinate net (fig. 3). Let the origin 
O of the polar co-ordinate system lie in the interior of our region. 
We subdivide the entire angle 27 into n parts of magnitude 


Fig. 3.— Subdivision by polar co-ordinate nets 


A@ = 2x/n = h, and we also choose a second quantity & = Ar. 
We now draw the lines 6 = vi(v = 0, 1, 2,..., ™— 1) through 
the origin and also the concentric circles r, = pk(u = 1, 2,...). 
Those which lie entirely in the interior of R we denote by R; 
and their areas by AR;. We can then regard the integral of the 
function f(x, y) over the region F as the limit of the sum 


Xf(é,, n:)AR; 


where (€;, 4;) is a point chosen arbitrarily in R, The sum is 
taken over all the sub-regions R, in the interior of R, and the 
passage to the limit consists in letting and & tend simultaneously 
to zero. 

By elementary geometry the area AR, is given by the equation 


AR, = 3(P 41 — 12h = 3(2e + 1A, 


if we assume that R, lies in the ring bounded by the circles 
with radii wk and (wu + Lk. 


3. Examples. 


The simplest example is the function f(z, y)= 1. Here the limit of 
the sum is obviously independent of the mode of subdivision and is always 
equal to the area of the region R. Consequently, the integral of the function 
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f(x, y) = 1 over the region is also equal to this area. This might have been 
expected, for the integral is the volume of the cylinder of unit altitude 
with the region R as base. 

As a further example we consider the integral of the function f(x, y) = 
over the square OS251,0SyS1. The intuitive interpretation of 
the integral as a volume shows that the value of our integral must be }. 
We can verify this by means of the analytical definition of the integral. 
We subdivide the rectangle into squares of side h= 1/n, and for the 
point (£,, ,) we choose the lower left-hand corner of the small square. 
Then each one of the squares in the vertical column whose left-hand 
side has the abscissa vk contributes the amount vh® to the sum. This 
expression occurs ” times. Thus the contribution of the whole column of 
squares amounts to nvh? = vh®. If we now form the sum from v= 0 
to v= n — 1, we obtain 


The limit of this expression as h —> 0 is 3, as we stated. 

In a similar way we can integrate the product ay, or more generally 
any function f(x, y) which can be represented as a product.of a function of 
x and a function of y in the form f(z, y)= 9(x)}(y), provided that the 
region of integration is a rectangle with sides parallel to the axes, say 


as2ezsb, 
eSysd. 


We use the same division of the rectangle as on p. 225, and for the value 
of the function in each sub-rectangle we take the value of the function 
at the lower left-hand corner. The integral is then the limit of the sum 


n—1 m—1 
hE YD &X o(vh)b(uk), 
=0 p=0 
which may also be written as the product of two sums in the form 
n—1 m—1 
(Sroom}) {"E ews}. 
v=0 u=0 


But in accordance with the definition of the ordinary integral, as h > 0 
and & —> 0 each of these factors tends to the integral of the corresponding 
function over the interval from a to 6 or from ¢ to d respectively. We 
thus obtain the general rule: if a function f(x, y) can be represented as a 
product of two functions o(x) and (y), tts double integral over a rectangle 
asxsb,cSy Sd can be resolved into the product of two integrals: 


He, ydzdy = [o(z)de. [yyyay. 
ty [etwas f 


In virtue of this rule and the summation rule (cf. p. 231) we can, for 
example, integrate any polynomial over a rectangle with sides parallel to 
the axes. 
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As a last example we consider a case in which it is convenient to use 
a subdivision by the polar co-ordinate net instead of a subdivision into 
rectangles. Let the region R be the circle with unit radius and centre the 
origin, given by a® + y* < 1, and let 


f(z y) = fl — 2 — y%) 


in other words, we wish to find the volume of a hemisphere of unit radius. 
We construct the polar co-ordinate net as before. From the sub- 
region lying between the circles with radii r,—= uk and r,41=(u+ Lk 


and between the lines 6 = vk and 0 = (v-+ 1)h (a= = =*) we obtain the 
contribution 


1 rT, 
b= Teale Te Tata Fray (Au41— 12h = V1 = pF eukh, 


where for the value of the function in the sub-region R, we have taken 
the value which the function assumes on an intermediate circle with the 
Totti lp 
aur ae 
the same contribution, and since there are n = 2x/h such regions the 
contribution of the whole ring is 


2nV1 — p,? p,k 


The integral is therefore the limit of the sum 


radius p, = All sub-regions which lie in the same ring give 


m—1 
x 2n V1 — e,? ok, 


u=0 


and, as we already know, this sum tends to the single integral 


So ee 1 
anf rV/1 = Par = -=-v(l—*7 aL- > 
0 


we therefore obtain 
ffvi Sepa 
- 3 


in agreement with the known formula for the volume of a sphere. 


4. Notation. Extensions. Fundamental Rules. 


The rectangular subdivision of the region R is associated with 
the symbol for the double integral which has been in use since 
Leibnitz’s time. Starting with the symbol 

n—iLm—1 


x = f(§, n,)Avdy 


v=0 pO 
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for the sum over the rectangles, we indicate the passage to the 
limit, from the sum to the integral, by replacing the double 
summation sign by a double integral sign and writing the symbol 
dz dy instead of the product of the quantities Ax and Ay. Accord- 
ingly, the double integral is frequently written in the form 


Jf fe ydady 


[fe as 


in which the area of AR is replaced by the symbol dS. We again 
emphasize that the symbol drdy does not mean a product, but 
merely refers symbolically to the passage to the limit of the 
above sums of mm terms as n > oo and m—> o. 

It is clear that in double integrals, just as in ordinary integrals 
of a single variable, the notation for the “variables of integra- 
tion ”’ is immaterial, so that we could equally well have written 


f i f(u, v)dudv or f i f(E n)dédn. 


In introducing the concept of integral we saw that for a 
positive function f(x, y) the integral represents the volume under 
the surface z= f(z, y). In the analytical definition of integral, 
however, it is quite unnecessary that the function f(x, y) should 
be positive everywhere; it may be negative, or it may change 
sign, in which last case the surface intersects the region R. Thus 
in the general case the integral gives the volume in question with 
a definite sign, the sign being positive for surfaces or portions of 
surfaces which lie above the zy-plane. If the whole of the surface 
corresponding to the region FR consists of several such portions, 
the integral represents the sum of the corresponding volumes 
taken with their proper signs. In particular, a double integral 
may vanish although the function under the integral sign does 
not vanish everywhere. 

For double integrals, as for single integrals, the following 
fundamental rules hold, the proofs being a simple repetition 
of those in Vol. I (p. 81). If c¢ is a constant, then 


[fete nas=ef [ fe yas. 


instead of in the form 
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Also, 
[ffent d@ mds=ff fe, nas+f fs, yas, 


that is: the integral of the sum of two functions ts equal to the sum 
of their two integrals. Finally, if the region R consists of two sub- 
regions R’ and R” that have at most portions of the boundary 
in common, then 


[ffens=f fre nas+f f fe nas, 


that is: when regions are joined together the corresponding integrals 
are added. 


5. (Integral Estimates and the Mean Value Theorem. 


As in the case of one independent variable, there are some 
very useful estimation theorems for the double integral. Since 
the proofs are practically the same as those of Vol. I, Chap. II, 
section 7 (p. 126), we shall here be content with a statement of 
the facts. 

If f(z, y) = 0 in R, then 


J [ fe nas = 0; 
similarly, if f(z, y) < 0, 
f f f(x, y)dS <0. 


This leads to the following result: 
If the inequality 
f(z, y) = $(@, 9) 


holds everywhere in R, then 
[fe nas =f [de yas. 


A direct application of this theorem gives the relations 


[ftenassf [|r n|as 
[ffenas=—f {te y)\as. 


and 
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We can also combine these two inequalities in a single formula: 


lf fre nas| sf f|re nas. 


If m is the lower bound and M the upper bound of the values 
of the function f(x, y) in R, then 


mAR < f f f(x, y) dS < MAR, 


where AR is the area of the region R. The integral can then be 
expressed in the form 


J [f@ yds = par, 


where yu is a number intermediate between m and M, the exact 
value of which cannot in general be specified more exactly.* 
This form of the estimation formula we again call the mean 
value theorem of the integral calculus. 
Here again the following generalization holds: if p(x, y) is 
an arbitrary positive continuous function in FR, then 


J [pe nie nas = vf f re, a8, 


where uz denotes a number between the greatest and least values 
of f, which cannot be further specified. 

These integral estimates show as before that the integral 
varies continuously with the function. More precisely, if f(x, y) 
and ¢(z, y) are two functions which satisfy the inequality 


| f(a, Yy) — (2, y)|<e«, 


where ¢ is a fixed positive number in the whole region R with area 


AR, then the integrals f f f(a, y)dS and f i dla, y)dS differ by 
R R 


less than «AR, that is, by less than a number which tends to 
zero with e. 

In the same way we see that the integral of a function varies 
continuously with the region. For suppose that two regions R’ 
and R” are obtained from one another by the addition or removal 
of portions whose total area is less than «, and suppose that 

* Just as in the case of continuous functions of one variable, we can state 


that the value p» is certainly assumed at some point of the region R by the 
continuous function f (x, y). 
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J (&, y) is a function which is continuous in both regions and such 
that | f(z, y) |< M, where M is a fixed number. Then the two 


integrals f i J (z, y) dS and f F(z, y) dS differ by less than Me, 
Rr Rr 


that is, by less than a number which tends to zero with e. 
The proof of this fact follows at once from the last theorem of 
the preceding sub-section. 

We can therefore calculate the integral over a region R as 
accurately as we please by taking it over a sub-region of R whose 
total area differs from the area of R by a sufficiently small amount. 
For example, in the region R we can construct a polygon whose 
total area differs by as little as we please from the area of R. 
In particular, we may suppose this polygon to be bounded by 
lines parallel to the z- and y-axes alternately, that is, to be pieced 
together out of rectangles with sides paraliel to the axes. 


6. Integrals over Regions in Three and More Dimensions. 


Every statement we have made for integrals over regions of 
the zy-plane can be extended without further complication or 
the introduction of new ideas to regions in three or more dimen- 
sions. If e.g. we consider the case of the integral over a three- 
dimensional region R, we have only to subdivide this region R 
by means of a finite number of surfaces with continuously varying 
tangent planes into sub-regions which completely fill R and 
which we denote by Ry, R,,..., Ry. If f(z, y, 2) is a function 
which is continuous in the closed region R, and if (,, 7; ¢;) 
denotes an arbitrary point in the region R,, we again form the sum 


Zs (Es ne» S)AR,, 


in which AR, denotes the volume of the region R;. The sum is 
taken over all the regions R,, or, if it is more convenient, only 
over those sub-regions which do not adjoin the boundary of R. 
If we now let the number of sub-regions increase beyond all 
bounds in such a way that the diameter of the largest of them 
tends to zero, we again find a limit independent of the particular 
mode of subdivision and of the choice of the intermediate points. 
This limit we call the integral of f(x, y, z) over the region R, and 
we denote it symbolically by 


[ff fe y, z)dV. 


9 (8912) 
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If, in particular, we effect a subdivision of the region into 
rectangular regions with sides Ax, Ay, Az, the volumes of the 
inner regions R, will all have the same value ArAyAz. As on 
p. 230, we indicate the possibility of this type of subdivision and 
the passage to the limit by introducing the symbolic notation 


Sf [fe y, 2) dady dz 


in addition to the one above. All the facts which we have men- 
tioned for double integrals remain valid for triple integrals apart 
from the necessary changes in notation. 
For regions of more than three dimensions the multiple 
integral can be defined in exactly the same way, once we have 
suitably defined the concept of volume for such regions. If 
in the first instance we restrict ourselves to rectangular regions 
and subdivide these into similarly oriented rectangular sub- 
regions, and if we further define the volume of a rectangle 


S454, +h, AgS%y Sat hy... Gp Sin Sant hg, 


as the product hh, ...h,, the definition of integral involves 
nothing new. We denote an integral over the »-dimensional 
region & by 


ff eee f fe Ug os sy Lp) At, ky... diby. 


For more general regions and more general subdivisions we must 
rely on the abstract definition of volume which we shall give 
in section 1 of the appendix (p. 287). 

In what follows, apart from section 3 of the appendix, we 
can confine ourselves to integrals in at most three dimensions. 


7. Space Differentiation. Mass and Density. 


In the case of single integrals and functions of one variable, 
we obtain the integrand from the integral by a process of dif- 
ferentiation, taking the integral over an interval of length h, 
dividing by the length h, and then letting h tend to zero. For 
functions of one variable this fact represents the fundamental 
connexion between the differential calculus and the integral 
calculus, and we interpreted it intuitively in terms of the concepts 
of total mass and density. For the multiple integrals of functions 
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of several variables the same connexion exists; but here it is 
not so fundamental in character. 
We consider the multiple integral (domain integral) 


[ [fe y)adS or [fff y, 2) dV 


of a continuous function of two or more variables over a region B 
which contains a fixed point P with co-ordinates (x, ¥,)—or 
(%p, Yo: 2%), 8 the case may be—and which has the content * 
AB. If we then divide the value of this integral by the content 
AB, it follows from the considerations of sub-section 5 (p. 232) 
that the quotient will be an intermediate value of the integrand, 
that is, a number between the greatest and the least values 
of the integrand in the region. If we now let the diameter of 
the region B about the point P tend to zero, so that the content 
AB also tends to zero, this intermediate value of the function 
I(x, y)—or f(x, y, 2}—must tend to the value of the function at 
the point P. Thus the ‘aes to the limit yields the relations 


lm [fe y) dS = f (Zo, Yo) 


and 


sim al Sl s@ y, 2)dV = f (Xp; Yo; %o)- 


This limiting process, which corresponds to the differentiation 
described above for integrals with one independent variable, we 
call the space differentiation of the integral. We see, then, that the 
space differentiation of a multiple integral gives the integrand. 


This connexion enables us to interpret the relation of integrand to 
integral in the case of several independent variables, as before, by means 
of the physical concepts of density and total] mass. We think of a mass of 
any substance whatever as distributed over a two- or three-dimensional 
region R in such a way that an arbitrarily small mass is contained in each 
sufficiently small sub-region. In order to define the specific mass or density 
at a point P, we first consider a neighbourhood B of the point P with 
content AB, and divide the total mass in this neighbourhood by the content. 
The quotient we shall call the mean density or average density in this sub- 
region. If we now let the diameter of B tend to zero, from the average 
density in the region B we obtain in the limit the density at the point P, 


* The word content is used as a general word to include the idea of length 
in one dimension, area in two dimensions, volume in three dimensions, and 
so on. 
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provided always that such a limit exists independently of the choice 
of the sequence of regions. If we denote this density by u(x, y)—or by 
u(x, y, 2)—and assume that it is continuous, we see at once that the process 
described above is simply the space differentiation of the integral 


J fo nas, 
ini } H(z, y, z)dV, 


taken over the whole region R. This integral taken over the whole region 
therefore gives us the total mass of the substance of density u in the 
region * R. 

From the physical point of view such a representation of the mass of 
a substance is naturally an idealization. That this idealization is reasonable, 
i.e. that it approximates to the actual situation with sufficient accuracy, 
is one of the assumptions of physics. 

These ideas, moreover, retain their mathematical significance even 
when y. is not positive everywhere. Such negative densities and masses 
may also have a physical interpretation, e.g. in the study of the distribution 
of electric charge. 


or 


3. Repuction or THE MuttieLtE INTEGRAL TO 
RepeateD SINGLE INTEGRALS 


The fact that every multiple integral can be reduced to single 
integrals is of fundamental importance in the evaluation of 
multiple integrals. It enables us to apply all the methods which 
we have previously developed for finding indefinite integrals to 
the evaluation of multiple integrals. 


l. Integrals over a Rectangle. 


In the first place we take the region R asa rectanglea < x < 5, 
a Sy S B in the zy-plane, and we consider a continuous function 
f(z, y) in R. In Vol. I, Chap. X (pp. 490-1) we used a process of 
cutting the volume under the surface z= f(z, y) into slices in 
order to make the following statement appear plausible: 


* What we have shown here is that the distribution given by the multiple 
integral has the same space-derivative as the mass-distribution originally 
given. It remains to be proved that this implies that the two distributions are 
actually identical; in other words, that the statement “space differentiation 
gives the density y»” can be satisfied by only one distribution of mass. The 
proof, which is not difficult, is passed over here. (It closely resembles the proof 
of the Heine-Borel covering theorem.) 
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To find the double integral of i(x, y) over the region R, we first 


regard y as constant and integrate f(x, y) with respect to x between 
the limits a and b. This integral 


Hy) =f fle de 


is a function of the parameter y, and we have then to integrate it 
between the limits a and B. In symbols, 


Jf fe vas=f'surdy, $y =f fe, ade, 
or, more briefly, 
Jf fee nas = fay f° fee, de. 


In order to prove this statement analytically, we return to 
the definition of the multiple integral on p. 226. Taking 


sf and k= eae. 
m 


pee 
we have 


J [fe yas = lim & E fla + ph, a+ vk) hk, 


mee yelywl 


where the limit is to be understood to mean that the sum on the 
right-hand side differs from the value of the integral by less than 
an arbitrarily small pre-assigned positive quantity «, provided 
only that the numbers m and n are both larger than a bound * 
N depending only on «. By introducing the expression 


©, =S fiat pha+ ih 
wc 
we can write this sum in the form 


EO, k. 


v=1 


1 1 
If we now choose an arbitrary fixed value for ¢, e. or ——_ 
sf » °8 700 °* 70,000" 
* The root idea of the following proof is simply that of resolving the double 
limit as m and n increase simultaneously into the two successive single limiting 
processes, first m -—> o when n is fixed and then n —> 0 (cf. Chap. II, Appendix, 
section 2 (p. 103)). 
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and for n choose any definite fixed number greater than N, we 
know that 


i f fla, y)dS — kEO, <e 


no matter how large the number m is, provided only that it is 
greater than NV. If we keep m fixed during the limiting process, 
the above expression will never exceed e. In accordance with 
the definition of the ordinary integral, however, in this limiting 
process the expression ®, tends to the integral 


A 
[fat vk)de= $(a + vb), 
and we therefore obtain 


f Jf (a, y)dS — bE $a + vk)| Se. 


For arbitrarily small values of « this inequality holds for all 
values of n which are greater than a fixed number N depending 
only on «. If we now let n tend to o (ie. let & tend to zero), 
then by the definition of the single integral and the continuity of 


[fle de = $y) we obtain 
lim kE pla + vb) = ['dy) dy, 


n> ym 
whence 


| ff ee nas —fderdy 


Since ¢ can be chosen as small ag we please and the left-hand 
side is a fixed number, this inequality can only hold if the left- 
hand side vanishes, i.e. if 


Jf fe yas = f'ayf fe, yaa. 


This gives the required transformation. 

This result accordingly reduces double integration to the per- 
formance of two successive single integrations. The double integral 
can be represented as a repeated single integral. 

Since the parts played by x and y are interchangeable, no 
further proof ie required to show that the equation 


Se 
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[fe nas=facf fe, nay 


2. Results. Change of Order of Integration. Differentiation 
under the Integral Sign. 


is also true. 


From the last two formule of the preceding sub-section we 
obtain the relation 


[auf Fle yao= faef fe, nay 


or, in words: 

In the repeated integration of a continuous function with constant 
limits of integration the order of integration can be reversed. 

This theorem can also be stated as follows: 

If the function £(x, y) is continuous in the closed rectangle, then 
in this rectangle we can perform the integration of the integral 

b 
f f(x, y)dx with respect to the parameter y by integrating with 
respect to y under the integral sign, that is, by integrating first with 
respect to y and then with respect to x. 

This theorem corresponds exactly to the rule for the differen- 
tiation of an integral with respect to a parameter (cf. section 1, 
p. 219). 

We obtain a further result if we regard one of the above 
limits, say b, as a variable parameter. We can then differentiate 
the double integral with respect to this parameter; by the funda- 


mental theorem of the differential and integral calculus we obtain 
the result 


a 8 
x J Lf @ ydedy = ff, vdy. 
Similarly, if we regard 8 as a variable parameter we obtain 
fini | 
apd [fe vdedy = J fle Bde. 
Finally, from the two equations we obtain 
oO? 
es dady = f (6, 
apap | Jf ydedy = S06, B) 
by repeated differentiation. 


240 MULTIPLE INTEGRALS (Crap. 


In other words: 

Differentiation of the integral with respect to one of the wpper 
limits leads to an ordinary integral over the corresponding side of 
the rectangle; mixed differentiation with respect to the two upper 
limits gives the integrand at the corresponding corner of the rectangle.* 

The theorem on the change of order in integration. has many 
applications. In particular, it is frequently used in the explicit 
calculation of simple definite integrals for which no indefinite 
integral can be found. 


As an example—for further examples see the appendix, section 3, 
pp. 313-6—we consider the intogral 


Cy = 
e7ae __ go bn 
I= f oa oe 
0 we 


which converges for a > 0,8 > 0. We can express J as a repeated integra] 
in the form 
00 ab 
t= f dx i] eV dy, 
0 a 


In this improper repeated integral we cannot at once apply our theorem 
on change of order. If, however, we write 


r b 
I=lim [ de f ev dy, 
r>w JQ a 
by changing the order of integration we obtain 
I= lim 
. T—>xda 
Since in virtue of the relation 


be Ty Td e-v 
[w= <— dy 
(4 y Ta y 


by. ew ty bo-Ty 
l—e b f e age 
a 


adele aot i 


the second integral tends to zero as T increases, we have 
CO p—-a% _. g—be b 
Tm f oS de = log-- 
0 x a 


In a similar way we can prove the following general theorem: if f(é) is 
00 
sectionally smooth for ¢ = 0, and if the integral if £0 dt exists, then 
1 
00 
_ b 
1=f flax) — fbx) 4, — #00) log? (a> 0, b> 0). 
0 x a 
* The reader’s attention may be drawn to the connexion between this 


formula and the theorem on change of order of differentiation (cf. p. 55); he 
should investigate for himself to what extent the two facts are equivalent. 
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For here we can again express the single integral as a repeated integral 


00 a 
T= faz {feu)dy 
0 b 
and change the order of integration. 


3. Extension of the Result to More General Regions. 


By a simple extension of the results already obtained we 
can prove that our result holds for regions more general than 
rectangles. We begin by considering a convex region R, that is, a 
region whose boundary curve is not cut by any straight line 
in more than two points unless the whole straight line between 


Fig. 4.—General convex region of integration 


these two points is a part of the boundary (fig. 4). We suppose 
that the region lies between the “lines of support” (cf. ex. 1 (6), 
p. 100) z= a, >= a, and y= Yo, y= y, respectively. Since 
for points of R the x-co-ordinate lies in the interval v7) S ¢ S 2, 
and the y-co-ordinate in the interval yy Sy Sy, we consider 
the integrals 


daly) 
[fe yaa 
dil) 


and 


alo) 
f I(x, y)dy, 


Ya(x) 


which are taken along the segments in which the lines y = const. 
and x = const. respectively intersect the region. Here ¢,(y) and 
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¢,(y) denote the abscisse of the points in which the boundary 
of the region is intersected by the line y= const., and ¢,(z) 
and ¢,(z) the ordinates of the points in which ne i pounder is 


intersected by the lines <= const. The integral f° fl (z, y)da is 


therefore a function of the parameter y, where dhe parameter 
appears both under the integral sign and in the BpRer and lower 


limits, and a similar statement holds for the integral i 7 (a, y) dy 


as a function of x. The resolution into repeated intel is then 
given by the equations 


J [tte nas = fay" Hee, yas 


To prove this we first choose a sequence of points on the are 
y = (x), the distance between successive points being less than 


Fig. 5 


a positive number 6. We join successive points by paths each 
consisting of a horizontal and a vertical line-segment, lying in R. 
The lower boundary y= %,(x) we treat similarly. We thus 
obtain a region R in R, consisting of a finite number of rectangles. 
The boundary of & above and below is represented by sectionally 
continuous functions y= #,(x) and y= y,(x) respectively 
(cf. fig. 5). By the known theorem for rectangles we have 


f [fe pas=f “ae if ee, yde. 
B aid ¥alx) 
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Since %,(z) and ,(z) are uniformly continuous, as 5 >0 the 
functions y,(z) and (x) tend uniformly to ¢¥,(z) and ¢,(zx) 
respectively, and so 


lim i vite y)dy = =|" Te y)dy 


uniformly in z. It follows that 


Wa( 20) ey a(x) 
ae x, y)du= | dx x, y)da. 
Ava lim fe a it y) im I df y) 
On the other hand, as ; ~> 0 the region @ tends to R. Hence 


Jim ff fla, yas =f ffte, nas. 
Combining the three equations, we have 
1 Ha(%) 

[ff nas=f def fle, aay. 


The other statement can be established in a similar way. 
A similar argument is available if we abandon the hypothesis 


Fig. 6.—Non-convex cegions of integration 


of convexity and consider regions of the form indicated in fig. 6. 
We assume merely that the boundary curve of the region is 
intersected by every parallel to the z-axis and by every parallel 
to the y-axis in a bounded number of points or intervals. By 
f J (x, y)dy we then mean the sum of the integrals of the function 


f(x, y) for a fixed x, taken over all the intervals which the line 
z= const. has in common with the closed region. For non- 
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convex regions the number of these intervals may exceed unity. 
It may change suddenly at a point «= € (as in fig. 6, right) in 
such a way that the expression if T(z, y) dy has a jump discontinuity 


at this point. Without essential changes in the proof, however, 
the resolution of the double integral 


[ [fe nas=f def fe, dy 


remains valid, the integration with respect to x being taken along 
the whole interval x,< 2 <2, over which the region R lies. 
Naturally the corresponding resolution 


[ff yas = f dy f fe, y)da 


also holds. 


Fig. 7.—Circular ring as region of integration 


If e.g. the region is the circle (fig. 7) defined by 27+ y? <1, then the 
resolution is as follows: 


J [se nas = =f def ar: v)dy. 


va— 


If the region is a circular ring between the circles 27+ y2= 1 and 
a+ y? = 4 (fig. 7), then 


J fs waedy = fas sf aC nay + f af ite v)dy 


SE Faomf “ef Fan 
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As a final example we take as the region F a triangle (fig. 8) bounded 
by the lines x = y, y= 0, and z= a(a > 0). If we integrate first with 


respect to 2, 
Jf [1 nas = [av f He, nde 


and if we integrate first with respect to y, 


Jf ft nas = [ae fp, nav. 


¥ 


¢) 
=a £ 


Fig. 8.—Triangle as region of integration 
Comparing the two results, we have 
a xe a a@ 
[ef te nay = fay f Fe nae. 
0 0 0 vy 
In particular, if f(z, y) depends on y only, our formula gives 


f “de f “fy)dy = f “fly) (a — y)dy. 


x 
From this we see that if the indefinite integral ii f(y) dy of a function f(a) 
0 
is integrated again, the result can be expressed by asingle integral (of. p. 221). 


4. Extension of the Results to Regions in Several Dimensions. 


The corresponding theorems in more than two dimensions 
are so closely analogous to those already given that it will be 
sufficient to state them without proof. If we first consider the 
rectangular region % [7% ySySy, %™S2%S%, anda 
function f(x, y, z) which is continuous in this region, we can 
reduce the triple integral 


v=ffffey, z)dV 
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in several ways to single integrals or double integrals. Thus 


ff i f(a, y, 2)dV = f “ae i iS f(a, y, 2)dady. 


[ffl naedy 


is the double integral of the function taken over the rectangle 
% StSYy, % Sy SH, 2 being kept constant as a parameter 
during this integration, so that the double integral is a function 
of the parameter z, Either of the remaining co-ordinates x and y 
can be singled out in the same way. 

Moreover, the triple integral V can also be represented as a 
repeated integral in the form of a succession of three single 
integrations. In this representation we first consider the expression 


[Prenat 


x and y being fixed, and then consider 


v1 
d; (x, y, 2) dz, 
Loses 
z being fixed. We finally obtain 


V =a f'ay [Fee y, 2) dz. 


In this repeated integral we could equally well have integrated 
first with respect to x and then with respect to y and finally 
with respect to z, or we could have made any other change in 
the order of integration; this follows at once from the fact that 
the repeated integral is always equal to the triple integral. We 
therefore have the following theorem: 

A repeated integral of a continuous function throughout a 
closed rectangular region is independent of the order of integration. 


Here 


The way in which the resolution is to be performed for non-rectangular 
regions in three dimensions scarcely requires special mention. We content 
ourselves with writing down the resolution for a spherical region 2* + y? +- 
2s: 


+1 +V(1—x) + V (1—x3—y?) 
[ff te y, z)dzdydz -{ dx f dy f S(x,y, 2)az. 
& -1 —VQ—x) MV (—st_y 
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ExaMPLES 
Evaluate the integrals in Ex. 1-8: 


1. f fevanay over the circle 27 + y? S1. 
a + y> — Bay(2? + 9") 
2. [Sr dady over the circle x? fk 
ery , we 

3. f. ‘ i (a? + y? + 2*)xyzdxdydz throughout the sphere 2? + y? + 2? 
Sr. 

4. zdadydz throughout the region defined by the inequalities 
@+ySs2,aty+2asi, 

f f (2 + y + 2)a*y*z* dx dy dz throughout the region + y+2S1, 

2£20,¥20,z220. 


d. 
6. f if f ay eae = , throughout the sphere 27 + y? + 251. 


dxdy dz 
i SS Saree ee + y? + (2 — 4)? throughout the sphere 27+ y?+ 221. 


d 
8. UaFS = 2 = ore Series (2) eae t 
9. Prove that if # (z) is a continuous function 


. +1 os 
tim male) = nf(0). 


4, TRANSFORMATION OF MULTIPLE INTEGRALS 


In the case of single integrals the introduction of a new 
variable of integration is one of the chief methods for trans- 
forming and simplifying given integrals. The introduction of 
new variables is likewise of great importance in the case of several 
variables. In the case of multiple integrals, in spite of their 
reduction to single integrals, explicit evaluation is generally 
more difficult than in the case of one independent variable, and 
the integration in terms of elementary functions is less often 
possible. Yet in many cases we can evaluate such integrals by 
introducing new variables in place of the original variables under 
the integral sign. Quite apart from the question of the explicit 
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evaluation of double integrals, the change of variables is of 
fundamental importance, since the transformation theory gives 
us a more complete mastery of the concept of integral. 

The most important special case is the transformation to polar 
co-ordinates, which has already been carried out in Vol. I, Chap. 
X (p. 494). Here we shall at once proceed to general transforma- 
tions. We first consider the case of a double integral 


/ if (a, y)ds =f [ fle, y)dady, 


taken over a region R of the zy-plane. Let the equations 


t= do(u, v) 
y = ou, 2) 


give a one-to-one mapping of the region R on the closed region 
R’ of the uv-plane. We assume that in the region R’ the functions 
¢@ and # have continuous partial derivatives of the first order 
and that their Jacobian 


du ‘|= es 
‘ns gute — Pude 
never vanishes in the closed region R’; to be specific, we assume 
that it is everywhere positive. We then know that with these 
assumptions the system of functions = ¢(u, v), y== ¥(u, v) 
possesses a unique inverse w= g(x, y), v= A(z, y) (p. 152). 
Moreover, the two families of curves u == const. and v = const. 
form a net over the region R. 

Heuristic considerations readily suggest how the integral 


D= 


f f S(&, y)dxdy can be expressed as an integral with respect 

to u and v, We naturally think of calculating the double integral 

, f J (x, y)dS by abandoning the rectangular subdivision of the 
Rg 


region and instead using a subdivision into sub-regions R, by 
means of curves of the net u = const. or v= const. We there- 
fore consider the values u= vh and v= yk, where h == Au and 
k = Av are given numbers and v and p take all integer values 
such that the lines w= vk and v= yk intersect R’ (so that 
their images are curves in R). These curves define a number 
of meshes, and for the sub-regions R, we choose those meshes 
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which lie in the interior of R (figs. 9,10). We now have to 
find the area of such a mesh. 
If the mesh, instead of being bounded by curves, were an 


x 


Figs. 9, 10.—Decomposition of regions in a transformation 


ordinary parallelogram, half the parallelogram being formed by 
the triangle with the vertices corresponding to the values 
(u,, v,), (u, + h, v,), and (u,, v,-+ k), then by a formula of ele- 
mentary analytical geometry (cf. Chap. I, p. 14) the area of 
the parallelogram would be given by the determinant 


d(u, +b h, Vy) a d(u,, %,) P(U,, Oy + k) a f(u,, v,) 
plu, > h, v,) = b(u,, v,) $(u,, Uy + k) — pu,, v,) ; 


which is approximately equal to 


dult,, v,) a v,) 
Pulty, Y,) Pole,» %,) 


On multiplying this expression by the value of the function f in 
the corresponding mesh, summing over all the regions R, lying 
entirely within R, and then performing the passage to the limit 
h— 0 and k->0, we obtain the expression 


J Jf (¢(u, v), #(u, v)) Ddudo 


for the integral transformed to the new variables. 

This discussion is incomplete, however, since we have not 
shown that it is permissible to replace the curvilinear meshes 
by parallelograms or to replace the area of such a parallelo- 
gram by the expression (4,2, — Wud,)hk; that is, we have not 
shown that the error thus caused vanishes in the limit as h > 0 
and k->0. Instead of completing this method of proof by 


hk = hkD. 
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making these estimates, we prefer to develop the proof of the 
transformation formula in a somewhat different way, which can 
subsequently be extended directly to regions of higher dimensions. 


v 


Fig. 11 Fig. 12 


For this purpose we use the results of Chap. III, section 3 
(p. 150) and perform the transformation from the variables x, y 
to the new variables uv, v in two steps instead of in one. We 
replace the variables z, y by new variables 2, v by means of the 

equations 
L=2 
Y~ Oleh 2) y = Dv, 2). 


Here we assume that the expres- 
sion ®, vanishes nowhere in the 
region R, ie. that ©, is every- 


Y- Dept) where greater than zero, say, 
and that the whole region R can 
be mapped in a one-to-one way 

teLy Xm Lyth on the region B of the av-plane. 
Fig. 13 We then map this region B in 


a one-to-one way on the region 
R’ of the uv-plane by means of a second transformation 


x= Y(u, v) 

v=, 
where we further assume that the expression V, is positive 
throughout -the region B. We now effect the transformation of 


the integral f f S(a, y)dady in two steps. We start with a sub- 
g 
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division of the region B into rectangular sub-regions of sides 
Az =h and Av=k bounded by the lines = const. = 2, and 
v= const.= v, in the av-plane. This subdivision of B corresponds 
to a subdivision of the region R into sub-regions R;, each sub-region 
being bounded by two parallel lines = z, and x= x, + h and 
by two arcs of curves y= O(v,, 2) and y= O(v, + &, x) (figs. 11, 12). 
By the elementary interpretation of the single integral, the 
area of the sub-region (fig. 13) is 


uth 
AR,=f[ [®(v, +k, 2) — @,, 2))}dx, 
which by the mean value theorem of the integral calculus can be 
written in the form 
AR, = h{P(v, + k, £,) — P(v,, 2), 
where Z, is a number between z, and z,-+ h. By the mean value 
theorem of the differential calculus this finally becomes 
AR; = hk®,(6,, Z,), 


in which 6, denotes a value between v, and v,+ k, so that 
(6,, %,) are the co-ordinates of a point of the sub-region in B 
under consideration. The integral over & is therefore the limit 
of the sum 


LfAR, = Thkf (Z,, OG, £,))©,(0,, Z,) 


as h—>0,k—>0. We see at once that the expression on the right 
tends to the integral 


[fe YO.dedv (y= Ol, 2) 
B 
taken over the region B. Therefore 


J Jf (2, y)dady = f J f(a, y)®, dade. 


To the integral on the right we now apply exactly the same 
argument as that just employed for 7 f S (x, y) da dy, transforming 
R 


the region B into the region R’ by means of the equations 
z= Pu, v), v=. 
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The integral over B then becomes an integral over R’ with 
an integrand of the form f®,¥,,, and we finally obtain 


a fe fla, y)®,¥, dud. 


Here the quantities z and y are to be expressed in terms of the 
independent variables u and » by means of the two transforma- 
tions above. We have therefore proved the transformation 
formula 


| f f@, ydady =f f fle, YOY dudr. 
R R 
By introducing the direct transformation z = ¢(u, v), y= o(u, v) 
the formula can at once be put in the form stated previously. 
For O(e ¥) _ ®, and A(z, ©) = Y,, and so by Chap. III, section 3 


“Aa, 0)” O(u, v) 
(p. 147) we have 


p=”) _ ow, 
O(u, v) 


We have therefore established the transformation formula for all 
cases in which the transformation z= ¢(u, v), y = y(u, v) can 
be resolved into a succession of two primitive transformations of 
the forms * «= 2, y= O(v, z) and v=», = ¥(u, »). 

In Chap. ITI, section 3 (p. 151), however, we saw that we can 
subdivide a closed region R into a finite number of regions in 
each of which such a resolution is possible, except perhaps that 
it may also be necessary to replace u by v and v by —u; this 
substitution is merely a rotation of the axes, and we see that it 
does not affect the value of the integral; in fact, even the simple 
heuristic argument at the beginning of this sub-section is perfectly 
rigorous for this case. We thus arrive at the following general 
result:f 

* We have assumed above that the two derivatives , and W,, are positive, 
ae Ws easily see that this is not @ serious restriction. For the inequality 
aus ) 
were both negative, we should merely have to replace x by —x and y by —y, 


which leaves the integral unchanged. The two primitive transformations then 
have positive Jacobians. 

t The above proof in the first instance holds only for every closed region 
R, lying entirely within R. Since, however, R, can be chosen so as to occupy 
all of & except a portion of arbitrarily small area, the transformation formula 
continues to hold for F itself. 


> 0 shows that these two derivatives must have the same sign. If they 
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If the transformation x = ¢(u, v), y = ou, v) represents a 
continuous one-to-one mapping of the Tose region R of the xy-plane 
on a region R’ of the uv-plane, and if the eid d and ys have 


continuous first derivatives and their Jacobian ey y) = puty— Puby 
is everywhere positive, then (u, ¥) 


Jf fe vydedy =f f 706, 0), He, » SE: 


For completeness we add that the transformation formula 


remains valid if the determinant oe vanishes without, how- 
ever, changing its sign, at a finite number of isolated points of 
the region. For then we have only to cut these points out of & 
by enclosing them in small circles of radius p. The proof is valid 
for the residual region. If we then let p tend to zero,* the trans- 
formation formula continues to hold for the region R in virtue 
of the continuity of all the functions involved. 

We make use of this fact whenever we introduce polar co- 
ordinates with the origin in the interior of the region; for the 
Jacobian, being equal to r, vanishes at the origin. 

In Chap. V, section 4 (p. 377) we shall return to transforma- 
tions with negative Jacobians, and we shall see that the argument 
remains essentially the same. Nevertheless, we would point out 
here that provided the Jacobian D does not vanish the hypothesis 
D > 0 in a sense involves no loss of generality, for we can always 
change the sign of D by interchanging u and v. A different 
method of proving the transformation formula will be given in 
Chap. V, § 3, p. 373. 


iy dudv. 


Regions of more than Two Dimensions. 


We can of course proceed in the same way with regions of 
more than two dimensions, e.g. with regions in three-dimensional 
space, and obtain the following general result: 

If a closed region R of xyz. . . -space is mapped on a region R’ 
of uvw... -space by a one-to-one transformation whose Jacobian 


O(a, Y, 2, ~~») 
O(u, v, w,.. .) 


* For another application of this method, see section 5, p. 262. 
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ts everywhere positive, then the transformation formula 


Sf [fem )dedyde... 
= ff [femne..) 22%) dudodw... 


o(u, v, W,.. ) 


holds. In n dimensions the Jacobian is an n-rowed deter- 
minant of similar construction to the Jacobian in two 
dimensions. 


As a special application, we can obtain the transformation formula for 
polar co-ordinates in another way. In the case of polar co-ordinates in the 
plane we must write r and 6 instead of u and v, and we at once obtain 


the expression oe 7H == r (cf. p. 144). In the case of polar co-ordinates 


tn space, defined by the equations 
“= rcosg sin® 
y= rsing sin® 
z= rcosé, 


in which > ranges from 0 to 27, 6 from 0 to x, and r from 0 to + 0, we 

must identify u, v, w with 7, 0, 9; as the expression for the Jacobian we 

obtain 

cos sin® = rcospcos@ —rsingsin® 

sing sinO§ =r sing cos@ r cos@ sin® | = r? sin 8. 
cos 6 —r sind 0 


O(n, ¥, 2) _ 
Or, 8, —) 


(This value +? sin®@ is obtained by expanding in terms of the elements of 
the third column.) The transformation to polar co-ordinates in space is 
therefore given by the formula 


ps af f(a, y, 2)dedydz = f f (e f(x, y, 2)7 sin@drdédo. 
R 


As in the corresponding case in the plane, we can also arrive at the trans- 
formation formula without using the general theory. We have only to 
start with a subdivision of space given by the spheres r = const., the 
cones § = const., and the planes ¢ = const. The details of this elementary 
method are similar to those of Vol. I, Chap. X, section 2 (p. 494) and can 
be left to the reader. 

In the case of polar co-ordinates in space our assumptions are not 
satisfied when r= 0 or 6 = 0, since the Jacobian then vanishes. The 
validity of the transformation formula, however, is not thereby destroyed. 
We can easily convince ourselves of this, as we did in the case of the 
plane. 
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EXAMPLES 
Yn-% 
1*, Evaluate the integral f f e’+2dxdy taken over the triangle with 
vertices (0, 0), (0, 1), (1, 0). 
2. Evaluate the integral 


faze dady 
i, (L-+ a+ y°)* 
taken 


(a) over one loop of the lemniscate (2? + y?)? — (x7 — y*) = 0, 
(b) over the triangle with vertices (0, 0), (2, 0), (1, V3). 


3. Evaluate the integral 


’ asia 


taken throughout the ellipsoid _ ate e+e +s ‘< 1, 
¢ 


4. Prove that 


f [eer azay = ae-* — he 
R id 0 +0 


{where R denotes the half-plane x =a > 0), by applying the transfor- 
mation 


e+tyP=wt+at, y= om 


| ff met + wrdedy 


is invariant on inversion. 


5. Prove that 


6. Evaluate the integral of Ex. 4, p. 247, by using three-dimensional 
polar co-ordinates. 


7. Evaluate the integral 
r= f f feosxe + yn + 20)dEdydt 


taken throughout the sphere &? + y?+ (S11. 
8. Prove that 


f f cos (aE + yn)d&dyn = 2I,(r)/r, (n= V (a2 + y9)) 


where the integral is to be extended over the circle €? + 7? S 1 and J, 
denotes the Bessel function defined in Ex. 5, p. 223. 
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5. Improper INTEGRALS 


In the case of functions of one variable we found it necessary 
to extend the concept of integral to functions other than those 
which are continuous in a closed region of integration. In fact, 
we did consider the integrals of functions with jump discon- 
tinuities and of functions with infinite values, and also integrals 
over infinite intervals of integration. The corresponding exten- 
sions of the concept of integral for functions of several variables 
must now be discussed. 


1. Functions with Jump Discontinuities, 


For functions which have jump discontinuities in the region 
of integration R the extension of the concept of integral is 
immediate. We assume that the region of integration can be 
subdivided by a finite number of smooth arcs of curves * into 
a finite number of sub-regions R,, Ry, ..., R, in such a way that 
the integrand f is continuous in the interior of each sub-region, 
and as the boundary of such a sub-region is approached from the 
interior the values of the function tend to definite continuous 
boundary values; but the limiting values obtained as we 
approach a point on a curve separating two sub-regions may 
differ according as we approach the point from one sub-region 
or the other. The integral of the function f over the region R 
we shall then define as the sum of the integrals of the function f 
over the sub-regions R,. The integrals of f over the regions R, 
are at once given by our original definition if for each sub-region 
we suppose that the function is extended by including the boun- 
dary values, so that it becomes a continuous function in the 
closed region R,. 


As an example we consider a function f(z, y) which is defined in the 
wquare OS 251,05 y<1 by 


f(x. y)=1 for y<-a, 
f(z. y)=2 for y=. 
For this function the line y= x is a line of discontinuity, and by the 


process described we find that the improper integral f f(x, y)dady taken 
over the square has the value 3. 


* By a smooth arc of curve we mean an arc with a continucusly turning 
tangent, 
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2. Functions with Isolated Infinite Discontinuities. 

If the integrand becomes infinite at a single point P of the 
region of integration, we define the integral of the function f over 
the region R by a process analogous to that for one independent 
variable. We mark off a neighbourhood U, about the point of 
discontinuity P, so that the closed residue R,= R—U, no 
longer contains the point P. There are many possible sequences 
of neighbourhoods U, whose diameters tend to zero as v increases, 
e.g. the sequence of circles or spheres about the point P with 
radius c= 1/v. If the sequence of the integrals over the residual 
region R, tends to a limit J, ie. if 


a [ff y)dS = I, 


and if this limit is independent of the particular choice of the 
sequence R,, then its value is called the integral or, more 
accurately, the improper integral of the function f over the region 


R, and we write 
1=f f f(, yds. 


Such an integral taken over the region R is sometimes called a 
convergent integral (or is said to converge). If no limiting value I 
exists, the integral is said to be divergent (or to diverge). The 
definition of course remains valid if P is an isolated point of 
indeterminacy, such as the origin for the function sin (ata) 

If in the neighbourhood of P the absolute value of the 
function remains below a fixed bound, the integral is always 
convergent. 

The general conditions for the convergence of an integral can 
therefore be stated as follows. To every positive « there corre- 
sponds a bound 5 = &(c) for which the following condition is 
satisfied: if U and U’ are any two (open) sub-regions of R which 
contain the point of discontinuity P and whose diameter is 
smaller than §, then the integrals of the function f over the 
closed residual regions G— U and G— U’ differ in absolute 
value by less than e. We shall illustrate these ideas by means of 
a few examples. 


The function 


f(x, y) = logV2? + 8 
10 (B912) 
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becomes infinite at the origin of the zy-plane. Therefore in order to calculate 
the integral over a region R containing the origin, e.g. over the circle 
22+ y? | 1, we must cut out the origin by surrounding it with a region 
Us whose diameter is less than 5, and we must then investigate the con- 
vergence of the integral taken over the residual region R; = R— Us; 
as § +0. The neighbourhood Us, certainly lies within a circle of radius § 
about the origin. In accordance with section 4 (p. 254) we transform 
the integral to polar co-ordinates and obtain 


J log-V 22 + y*dady =f r logrdrd®, 
Bs ay 


where the integral on the right is taken over the region FR,’ of the r@-plane 
corresponding to the region Rs. In our case this is a region which contains 
the rectangle 8 Sr <1, 0 [0S 2x but does not include the straight 
line r= 0. The function 7 logr is continuous for r= 0, however, if we 


assign the value 0 to it at that point; for lim rlogr = 0. We can therefore 
r—>?0 
let 3 tend to 0 and regard the transformed integral 


pie rlogrdrd6 = lim fg r logrdrd® 
Rr 80 dry 


as an ordinary integral in the sense of section 2 (p. 224). The convergence 
of the integral is therefore established. 


At the same time this example shows that, as in the case ot 
one independent variable, a properly chosen transformation of 
co-ordinates sometimes changes an improper integral into a proper 
integral. This fact clearly shows how inadmissible a restriction 
we should lay upon ourselves if we refused to consider improper 
integrals. 

As a further example we consider the integral 


(Var + y)"” 
taken over the same region. If we first think of the integral as 


taken over the region R, obtained from RF by cutting out a 
circle with radius 6, and then transforming to polar co- 


ordinates, we obtain 
ff Anvres, 
Rs ° 


or, as a repeated integral, 
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Rg 
From Vol. I (p. 246) we know that the integral if aus is convergent 
0 
if, and only if, a<2. We therefore conclude that the double 


dx dy a at ; é 
integral is likewise convergent if, and only if, 
ie Lovers ai (Ve + ¥)" : : 
a<2. As in the preceding example, the convergence is inde- 

pendent of the particular choice of the sub-regions U;. 

This remark can readily be used to obtain a sufficient (by no 
means a necessary) criterion for the convergence of improper 
double integrals, which is applicable in many special cases, 

If in the closed region R the function {(x, y) is continuous every- 
hab except at one point P, which we take as the origin x = 0, 

= 0, and f becomes infinite at P, and if there is a fixed bound 
M and a positive number a <2 such that 


lf(@, y) |< ———- 


(Va+ 9) + + yt) 
everywhere in R, then the integral 


[ [Fle ydedy 
converges. : 


The proof is obtained from the above by considering the 
relation 


[ [fe naedy] < ff 


where B is a region not containing P and lying within a small 
circular neighbourhood of P. 
We can deal with the triple integral 


da dy 


fe, | dedy sm | | ae 


if f [Ses dx dy dz 

(Va? + y® + 22)" 
in a similar way. If R contains the origin, we introduce polar 
co-ordinates and obtain 


ff j om sin 0dr dO dd. 


A discussion similar to the preceding shows us that convergence 
occurs when a < 3. Asa general criterion we have the following: 
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The integral of a function {(x, y, z) which becomes ‘infinite at the 
origin, but is continuous at every other point of a region R containing 
the origin, is convergent, if there is a fixed bound M and a positive 
number a <3 such that the inequality 


lf (a, y, = 


Ss 
z)| = (V2 + ye 2)" 


holds everywhere in the region. 
From these criteria we conclude more generally that integrals 
of the form 


g(x, y)dxdy 
Weye=a¥ + 5 


over a two-dimensional region and integrals of the form 


y; i f g(x, y, 2) dady dz (a <3) 
(Ve — a? + (y— 68 + (@— 0)" 


over a three-dimensional region converge, where (a, 6), or (a, 8, c), 
is a fixed point in the interior of the region R and g is a continuous 
function in the closed region R. We have only to transfer this 
point to the origin by translation of the co-ordinate system and 
then to apply our criterion. 


3. Functions with Lines of Infinite Discontinuity. 


Tf a function f(z, y) becomes infinite not only at a single point 
but along whole curves C in the region R, we can proceed to 
define the integral of a function f over the region RF in an exactly 
analogous way. We cut the curve of discontinuity C out of the 
region R by enclosing it in a region U, of area less than «. If 
then as « tends to 0 the integral of the function f over the re- 
gion R— U, tends to a limit I independent of the particular 
choice of the region U,, we say that the integral of £ over the region 
R ts convergent (or converges) and we take this limiting value as 
the value of the integral. 

The simplest example is the case in which the curve C consists 
of a portion of a straight line, say a segment of the y-axis. If 
the relation 


Ife vi<%, 
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where M is a fixed bound and a is less than 1, is valid everywhere 
in the region R, then the integral over the region R converges. 
The proof is similar to the proofs of the preceding sub-section. 
For example, we may cut the y-axis out of the region by means 
of straight lines parallel to it. 


4. Infinite Regions of Integration. 


If the region R extends to infinity, we approximate to it by 

a sequence of sub-regions R,, R,,..., F,,---; which are all 
bounded and have the property that every arbitrary bounded 
sub-region of R is contained in every R, for which n is greater 
than a certain m. (If, for example, R is the whole plane, for R, 
we can choose the circular region of radius v about the origin.) 
If the limit 

lim ff fle, yas 

v—> oO By 
exists and is independent of the particular choice of the sequence 
of sub-regions R,, we call it the integral of the function f over the 
region R. 


To illustrate this statement by an example, we consider the integral 


a f 9" dedy, 
R 


where the region of integration is the whole xy-plane. In order to establish 
the convergence of this integral we first choose the sub-regions R, as the 
circles K, with radius vy, 

a+ysv; 


these obviously satisfy the above requirements. We have therefore to 
investigate the limit of the integral 


f ei eo y"dedy 
Ky 


asv—o. But we have already evaluated this integral (Vol. I. p. 496) 
and have found it to be equal to (1 — e~”). Now 

lim n(l— eo”) =x. 

v—> oo 
If we also show that not only the sequence of circles, but also every other 
sequence of sub-regions R with the properties mentioned, leads to the 
same value x, then according to our definition the number 7 will be the 


value of the improper integral. 
Let any sequence of such regions Rj, R,,... be given. By hypothesis, 
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each circle K,, is contained in the interior of R, provided v is sufficiently 
large; on the other hand, every R, is bounded and is therefore contained 
in a circle K,, of sufficiently large radius M. Since the integrand enay’ 
is positive everywhere, it follows that 


f fe e--'dedy < sf fe cPdedy < sf fe ei dedy, 
Km Ry Eu 


As m and increase, the integrals over K,, and Ky have the same limit r, 
so that the integral over R, must have the same limit; this proves that 
the integral must converge to the limit 7. 

We obtain a particularly interesting result if for the regions R, we choose 


the squares |x| Sv,|y{ Sv. The integral f f e-**-¥"dedy can then be 
Ry 
reduced to two simple integrations (cf. section 2, p. 228): 


of ¥ v 2 v 2 
f ff et dady= | eda fl e-Vdy = ( i e*de) = (ef «*dz) . 
gy -y —y ~y 0 


If we now let v tend to », we must again obtain the same limit x. Hence 
20 2 
(2 f eda) = 
0 


[ e-** da 1 44/n, 


in agreement with Vol. I, p. 496. 


or 


5. Summary and Extensions. 


Té is useful to consider the concepts of this section again from 
a single unifying point of view. Our extension of the concept of 
integral to cases in which the definitions in section 2 (p. 224) 
are not immediately applicable consists in regarding the value 
of the integral as the limiting value of a sequence of integrals 
over regions R,, which approximate to the original region of in- 
tegration F as v increases. For this purpose we regard the region 
R as open instead of closed; we assign all the points of dis- 
continuity of the function f to the boundary and consider the 
boundary as not belonging to R. We then say that the region 
is approximated to by @ sequence of regions Ry, Re,..., Ry, .. 
af all the closed regions R, lie in R and every arbitrarily chosen 
closed sub-region in the interior of R is also a sub-region of the 
region R,,, provided only that n is sufficiently large. If in particular 
the sub-regions R, are so chosen that each one contains the 
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preceding one in its interior, we say that they converge mono- 
tonically to the region R. 

For the sub-regions R, we can at once apply the original 
definition of the integral given in section 2, p. 224. We now 
say that the integral of f over the region R converges if the integral 
over R,, has a limiting value independent of the particular choice 
of the sequence of regions R,. It is useful to state specifically the 
following general facts which have been illustrated by the previous 
examples. 

(1) If the function f is nowhere negative in the region R, it is 
sufficient to show that for a single monotonic sequence R, the 
sequence of values of the integral converges, in order to ensure 
convergence to the same limit for an arbitrary sequence R,’. 

Proof. R,, being a closed region in the interior of R, is con- 
tained in all regions RF, from a certain n(v) onward. Conversely, 
every region #;, is contained in a certain R,,, for the same reason. 
Since the function is nowhere negative, it follows that 


JJ.fendady sf fi fle ydedy sf f fle, ydedy. 


As v increases the two outer bounds tend to the same limit; the 
sequence of integrals i i) (x, y)dxdy must therefore converge 
Bn 


to that limit, and our statement is proved. 

In particular, if for R, we choose a monotonic sequence of 
regions tending to FR, it follows that the function f, which is 
nowhere negative, has a convergent integral over the region R, 
provided only that the sequence of integrals over R, remains 
below a bound M. For these integrals then form a sequence of 
numbers which is monotonic non-decreasing and bounded, and 
therefore convergent. 

The case in which f is nowhere positive in R can at once be 
reduced to the preceding if we replace f by —f. 

(2) If f changes sign in the region R, we can apply the previous 
theorem to | f|. If the integral of this absolute value converges, 
it is certain that the integral of the function f itself converges. 
This is most easily proved by the following device. We put 


f=h—-Sfe 
where f,=/ if f 20, otherwise f, = 0, and f, = —f if fS0, 
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otherwise f, = 0. The two functions /,, f, are nowhere negative, 
are continuous where f is continuous, and in absolute value 
never exceed f itself. Hence, if the integral of | f | remains bounded 
for a monotonic sequence R,, the integrals of f, and /, converge, 
and with them the integral of their difference, f, — fy. 


6. GEOMETRICAL APPLICATIONS 


1. Elementary Calculation of Volumes. 


The concept of volume forms the starting-point of our 
definition of integral. It is immediately obvious, therefore, how 
we can use multiple integrals in order to calculate volumes. 


For example, in order to calculate the volume of the ellipsoid of 
revolution 


we write the equation in the form 
z= to yiat— a 2), 


The volume of the half of the ellipsoid above the zy-plane is therefore 
given by the double integral 


f= f fue ene 


taken over the circle 22 +- y* < a?. If we transform to polar co-ordinates, 
the double integral becomes 


f{ frve — r*)drd0, 
or on resolution into single integrals 
V b ee rd 6b rea 
a=if ao f raat — 18)dr = an~ fra — 1) dr, 
2 aJo 0 aJ/o 
which gives the required value, 
v= ; ma*b, 


To calculate the volume of the general ellipsoid 


“of 8 
apie? 
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we make the transformation 
z=aocos@, y= bo sind, 
aa 9) _ 
&p, 6) 


and ay half the volume obtain 


bef f(t 3—B) ety of fev — eae 


Here the region R’ is the rectangle 0 S p S$ 1,0S 6 S2n. Thus 


Veuel wet 2 Ss 
>= ef of evil— edp= 5 ° 


or 


Finally, we shall calculate the volume of the pyramid enclosed by the 
three co-ordinate planes and the plane az + by + cz — 1= 0, where we 
assume that a, 6, and ¢ are positive. For the volume we obtain 


= eh (1 — ax — by)dzedy, 


where the region of integration is the triangle 0 S x Ss . »OSys r( 1— az) 
in the zy-plane. Therefore 


1 pila p(l—ax)/b 
V=-|[d& 1 — az — by)dy. 
f f ( by) dy 


c 


Integration with respect to y ab 


(1-ax)/b — 2 
(= any — sy [= OS, 


26 
and if we integrate again by means of the substitution 1 — ax = t, we obtain 
V 2 3 ue 1 
= o,f lands = — gi, (1 — az) | = Be 


We could of course have obtained the result from the theorem of elementary 
geometry that the volume of a pyramid is one-third of the product of base 
and altitude. 


In order to calculate the volume of a more complicated solid 
we can subdivide the solid into pieces whose volumes can be 
expressed directly by double integrals. Later, however (in par- 
ticular in the next chapter), we shall obtain expressions for the 
volume bounded by a closed surface which do not involve this 
subdivision. 

10° (#912) 
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2. General Remarks on the Calculation of Volumes. Solids of 
Revolution. Volumes in Polar Co-ordinates. 


Just as we can express the area of a plane region R by the 


double integral 
[f= [ dedy, 


we may also express the volume of a three-dimensional region R 
by the integral 
= dadyd 
V f f i x dy dz 


over the region R. In fact this point of view exactly corresponds 
to our definition of integral (cf. Appendix, p. 291) and expresses 
the geometrical fact that we can find the volume of a region 
by cutting the space into identical parallelepipeds, finding the 
total volume of the parallelepipeds contained entirely in R, and 
then letting the diameter of the parallelepipeds tend to zero. 
The resolution of this integral for V into an integral i dz a i dx dy 
expresses Cavalieri’s principle, known to us from elementary 
geometry, according to which the volume of a solid is determined 
if we know the area of every plane cross-section which is per- 
pendicular to a definite line, say the z-axis. The general expression 
given above for the volume of a three-dimensional region at once 
enables us to find various formule for calculating volumes. For 
this purpose we have only to introduce new independent variables 
into the integral instead of a, y, z. 

The most important examples are given by polar co-ordinates 
and by cylindrical co-ordinates; the latter will be defined 
below. We shall calculate e.g. the volume of a solid of revolution 
obtained by rotating a curve x= ¢(z) about the z-axis. We assume 
that the rotating curve does not intersect the z-axis and that the 
solid of revolution is bounded above and below by planes z= const. 
The solid is therefore defined by inequalities of the forma Sz b 
and 0S v/(z?+ y?) S d(z). Its volume is given by the inte- 
gral above. If we now introduce the cylindrical co-ordinates 
2, pH (22+ 9"), = arc 008 = arc sin” instead of 2, Y, %, 
P 


we at once obtain the expression 
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aff [dcdyde =f df a9 f "pap 


for the volume. If we perform the single integrations, we at once 
obtain 


V= if beede 


(cf. Vol. I, Chap. V, section 2, p. 285). 

We can also obtain this expression intuitively. If we cut 
the solid of revolution into small slices z, Sz S 2,4, by planes 
perpendicular to the z-axis, and if by m, we denote the minimum 
and by M, the maximum of the distance ¢(z) from the axis in 
this slice, then the volume of the slice lies between the volumes 
of two cylinders with altitude Az = z,,, — z, and radii m, and 
M, respectively, Hence 

im rhe SV SUMP rhz. 


By the definition of the ordinary integral, therefore, 
b 
V= nf pede. 


If the region R contains the origin O of a polar co-ordinate 
system (7, 9, ¢) and if the surface is given in polar co-ordinates 


by an equation 
r= (9, >) 


where the function f(9, ¢) is single-valued, it is frequently advan- 
tageous to use these polar co-ordinates instead of (x, y, 2) in 
calculating the volume. If we substitute the value of the Jacobian 


A(z, ys 2) = 1 gin @ (as calculated on p. 254) in the transformation 


O(r, 8, d) 


formula, we at once obtain the expression 


2r 7 (8, &) 
V= rsinddrdédd =| d in 6d@ 2dr 
f i, f sin Odrdédd f ip f sin r 
for the volume. Integration with respect to r gives 
1 2n v . 
v=, i db ip (8, $) sin 0d0. 


In the special case of the sphere, in which f(0, 9) = R is constant, we 
at once obtain the value $nX? for the volume of the sphere. 


268 MULTIPLE INTEGRALS [CuaP. 


3. Area of a Curved Surface. 


We have already expressed the length of arc of a curve by 
an ordinary integral (Vol. I, p. 279). We now wish to find an 
analogous expression for the area of a curved surface by means of 
a double integral. We regard the length of a curve as the limiting 
value of the length of an inscribed polygon when the lengths of 
the individual sides tend to zero. For the measurement of areas 
a direct analogy with this measurement of length would be as 
follows: in the curved surface we inscribe a polyhedron formed 
of plane triangles, determine the area of the polyhedron, make 
the inscribed net of triangles finer by letting the length of the 
longest side tend to zero, and seek to find the limiting value of 
the area of the polyhedron. This limiting value would then be 
called the area of the curved surface. It turns out, however, 
that such a definition of area would have no precise meaning, for 
in general this process does not yield a definite limiting value. 
This phenomenon may be explained in the following way: a 
polygon inscribed in a smooth curve always has the property, 
expressed by the mean value theorem of the differential calculus, 
that the direction of the individual side of the polygon approaches 
the direction of the curve as closely as we please if the subdivision 
is fine enough. With curved surfaces the situation is quite 
different. The sides of a polyhedron inscribed in a curved surface 
may be inclined to the tangent plane to the surface at a neighbour- 
ing point as steeply as we please, even if the polyhedral faces 
have arbitrarily small diameters. The area of such a polyhedron, 
therefore, cannot by any means be regarded as an approximation 
to the area of the curved surface. In the appendix we shall 
consider an example of this state of affairs in detail (pp. 341-2). 

In the definition of the length of a smooth curve, however, 
instead of using an inscribed polygon we can equally well use a 
circumscribed polygon, that is, a polygon of which every side 
touches the curve. This definition of the length of a curve as the 
limit of the length of a circumscribed polygon can easily be 
extended to curved surfaces. The extension is even easier if we 
start from the following remark: we can obtain the length of a 
curve y= f(z) which has a continuous derivative f(z) and lies 
between the abscisse a and b by subdividing the interval between 
aand bat the points 2, 7%, . . . , Z, into n parts of equal or different 
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lengths, choosing an arbitrary point ¢, in the v-th sub-interval, 
constructing the tangent to the curve at this point, and measuring 
the length 1, of the portion of this tangent lying in the strip 


2,<c<2,,. The sum DI, then tends to the length of the 


v=1 


b 
curve, i.e. to the integral f V{1 + f'(x)*} da, if we let n increase 


beyond all bounds and at the same time let the length of the 
longest sub-interval tend to zero. This statement follows from 
the fact that J, = (2,4, —2,)/f1+S'(é,)}- 

We can now define the area of a curved surface in a similar 
way. We begin by considering a surface which lies above the 
region R of the ay-plane and is represented by a function 
z= f(a, y) with continuous derivatives. We subdivide R into 
n sub-regions R,, R,,..., Ry» with the areas AR,,..., AR, 
and in these sub-regions we choose points (£, 1), -- - » (ns Mn): 
At the point of the surface with the co-ordinates é, 7, and 
t,=f(é, 7,) we construct the tangent plane and find the area 
of the portion of this plane lying above the region R,. If a, is 
the angle which the tangent plane 


z— f = fl é,, ,) («— é) +H, ny) y= ny) 


makes with the ay-plane, and if Az, is the area of the portion 7, 
of the tangent plane above R,, then the region R, is the projection 
of 7, on the zy-plane, so that 


AR, = Az, cosa, 


Again (cf. Chap. ITI, section 2, p. 130), 
1 


V1 + felEn m) + hrs) 
and therefore 


Ar, = 1 + feXE,, Nv) + fHE, Nv) 4 AR,. 


If we now form the sum of all these areas 


cosa, = 


= Az, 

v=1 
and let increase beyond all bounds, at the same time letting 
the diameter (and consequently the area) of the largest sub- 
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division tend to zero, then according to our definition of integral 
this sum will have the limit 


A=f [V+ fE+ fis. 


This integral, which is independent of the mode of subdivision 
of the region R, we shall define as the area of the given surface. 
If the surface happens to be a plane surface, this definition agrees 
with the preceding; for example, if z= f(x, y) = 0, we have 


A=| fas. 


It is occasionally convenient to call the symbol 


do= Vi +fe+ f2d8 = V1+f2-+ frdady 


the element of area of the surface z= f(x, y). The area integral 
can then be written symbolically in the form 


KC 


We arrive at another form of the expression for the area if 
we think of the surface as given by an equation ¢(x, y, z) = 0 
instead of z= f(x, y). If we assume that on the surface ¢, + 0, 
say ¢, > 0, then the equations 


Oz _ be Oz _ dy 


at once give the expression 
1 
J [VEFF SEF GP 5 dey 


for the area, the region R again being the projection of the 
surface on the zy-plane. 
As an example of the application of the area formula we consider the 


area of a spherical surface. The equation z= V(R? — 2? — y*) represents 
a hemisphere of radius R. We have 


dz a dz y 


de (RHP)! dy 4/(R—— yy 
The area of the hemisphere is therefore given by the integral 


aR f (7a =o 
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where the region of integration R’ is the circle of radius RF lying in the 
zy-plane and having the origin as its centre. By introducing polar co- 
ordinates and resolving the integral into single integrals we further obtain 


or fh rdr z rdr 
A= Rf af ee = 2k | a 
3 0 Jo Vik — ) o V (BR? — 1) 
The ordinary integral on the right can easily be evaluated by means of 
the substitution R? — r? = u; we have 
R 


4A = ~—2nRV/R?— 2 - 2nR, 


in agreement with the fact, known from elementary geometry, that the 
area of the surface of a sphere is 4nR?. 


In the definition of area we have hitherto singled out the co- 
ordinate z. If, however, the surface had been given by an equation 
of the form z = a(y, 2) or y = y(2, z), we could equally well have 
represented the area by integrals of the form 


[f[vataP+ e2dyde or f f¥A+yP+ y2)dede, 
or, if the surface were given implicitly, we should have 
[[Vb2+ $84 42) 5 des 


or 


[[VG2+ bb + 62) 5 dude 


That all these expressions do actually define the same area 
is self-evident. The equality of the different expressions can, 
however, be verified directly. For example, we apply the trans- 
formation 

t= xy ? 2), 
y=y 


ff Vibe ae + $0) aridy. 


to the integral 


Here x = 2(y, 2) is found by solving the equation 4(z, y, z) = 0 


pu 1 Pe and therefore 
oy, 2) be 


for x. The Jacobian is 
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V(b? + $y*+ 2%) — f (Vibe + bP + $27) 
1D mamma ea dady =ff 7 dy dz. 


The integral on the right is to be taken over the projection R’ 
of the surface on the yz-plane. 

If in expressing the area of a surface we wish to get rid of any 
special assumption about the position of the surface relative to 
the co-ordinate system, we must represent the surface in the 
parametric form 


t= d(u, ”), y= Pu, v), 2= x (u, v). 


A definite region R’ of the uv-plane then corresponds to the sur- 
face. In order to introduce the parameters u and v in the 
above formule we first consider a portion of the surface and 


assume that for this portion the Jacobian iad = D is every- 
where positive. According to Chap. III, section 3, p. 153, for this 
portion we can then solve for u and v as functions of a and y, 
obtaining 


for the partial derivatives. 
In virtue of the equations 


Oz __ oO 
ox (Ou 
we obtain the expression 


lt+ (a) +) 


= 5 V{(dutto— typ)? + (huxe a XuPo)?+ (xuPv oo duxe)?}- 


te + £09 and dy ut Oe 


If we now introduce u and v as new independent variables and 
apply the rules for the transformation of double integrals (p. 253), 
we find that the area of the portion of the surface corresponding 
to R’ is 
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A= f [Vue dube) + (uxe— Xutho)®-+(Xubo— Pure)" dude. 


In this expression there is no longer any distinction between the 
co-ordinates z, y, and z. Since we arrive at the same integral 
expression for the area no matter which one of the special non- 
parametric representations we start with, it follows that all 
these expressions are equal and represent the area. 

So far we have only considered a portion of the surface on 
which one particular Jacobian does not vanish. We reach the 
same result, however, no matter which of the three Jacobians 
does not vanish. If then we suppose that at each point of 
the surface one of the Jacobians is not zero, we can subdivide 
the whole surface into portions like the above, and thus find that 
the preceding integral still gives the area of the whole surface: 


A= if i V { (buh tube)? + (huxe— Xutlr)?e+ ( Xuvo— duxe)*}du dv. 


The expression for the area of a surface in parametric re- 
presentation can be put in another noteworthy form if we make 
use of the coefficients of the line element (cf. Chap. III, section 4, 


p. 163) ds? = Edu? + 2Fdudv + Gde*, 


that is, of the expressions 
E= db? + 2 + Xu, 
F= dubo + Duty + XuXes 
G= gy" + 2? + xe”. 

A simple calculation shows that 


EG— P= (bufio— tude)” + (huxXe— Xutbo)? + (Xupo— PuXv)* 


Thus for the area we obtain the expression 
f f (EG — F*)dudv, 


and for the element of area 
do == »/(EG — F?)dudo. 


As an example we again consider the area of a sphere with radius R, 
which we now represent parametrically by the equations 
a= Reosusinye, 
y = Rsinu sin», 
z= Recosv, 
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where u and v range over the region 0 Su XS 2nand0 SvSn. Asimple 
calculation once more gives us the expression 


Qa od 
mf auf anode = anhe 
0 0 


for the area. 


In particular, we can apply our result to the surface of revolu- 
tion formed by rotating the curve z= ¢(x) about the z-axis. 
If we refer the surface to polar co-ordinates (u, v) in the ay-plane 
as parameters, we obtain 


r= ucosy, y=usiny, 2= g(V22+ y?) = J(u). 
Then 
E=14¢%), F=0, Gaus, 


and the area is given in the form 
ar ty tty ey 
f dof uv 1+ ¢'2(u)du = an f uV1-+ $’2%(u)du 
0 Uo Uy 


If instead of u we introduce the length of arc s of the meridian 
curve z= ¢(u) as parameter, we obtain the area of the surface 
of revolution in the form 

Qa if uds, 


where u is the distance from the axis of the point on the 
rotating curve corresponding to s (Guldin’s rule; cf. Vol. I, 
p. 285). 

As an example we calculate the surface area of the torus or anchor ring (cf. 


Chap. ITI, section 4, p. 165) obtained by rotating the circle (y— a)? + 2% = r? 
about the z-axis. If we introduce the length of arc s of the circle as a 


parameter we havew=a-+r cos, and the area is therefore 


207 2rr 
anf uds = anf (a+ r cos- *) as = 2na. 2xr. 


The area of an anchor ring is therefore equal to the product of the cireum- 
ference of the generating circle and the length of the path described by the 
centre of the circle. 
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EXAMPLES 
1. Calculate the volume of the solid defined by 
{Vv (et + y*) — 1}? 


a? 


4% <1 <)}) 
= ; 


2. Find the volume cut off from the paraboloid 


by the plane z = h. 
3. Find the volume cut off from the ellipsoid 


by the plane 
te + my + nz = p. 


4. (a) Show that if any closed curve 6 = f(¢) is drawn on the surface 
7? = a* cos 20 


(r, 9, » being polar co-ordinates in space), the area of the surface so enclosed 
is equal to the area enclosed by the projection of the curve on the sphere 
r = a, the origin of co-ordinates being the vertex of projection. 

(6) Express the area by a simple integral. 

(c) Find the area of the whole surface. 

5. Find the area of the surface of the spheroid formed by rotating an 
ellipse about its major axis, and show that if the fourth and higher powers 
of the eccentricity e may be neglected, this area is equal to that of the 
sphere whose volume is equal to that of the spheroid. 

6. Find the volume and surface area of the solid generated by rotating 
the triangle ABC about the side AB. 


7*, A tube-surface is generated by the spheres of unit radius whose 
centres form the closed plane curve Z. Prove that the area A of the 
surface is 27 times the length of L. 


8*, (a) Calculate the volume of the region defined by 
aty+2e2 sn 
e+y— rz 20 
e+y+tre 20. 


(b) Calculate the area of the spherical part of the boundary of this 
region, i.e. the area of the surface 


at y+ 2 = r2 
2@4+y-— rz 20 
@+y+r2z 20. 
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9. Calculate the area of that part of the sorew surface 
z 
—atan~ = 
y¥—% an 5 0 
for which 


r<otys Rk, lel Soh 


10. Calculate the area of the surface 
(at oP + at) at — yA 


7. Poysica, APPLICATIONS 


In section 2, No. 7 (p. 235) we have already seen how the 
concept of mass is connected with that of a multiple integral. 
Here we shall study some of the other concepts of mechanics. 
We begin with a more detailed study of moment and of moment 
of inertia than was possible in Vol. I, Chap. X (p. 496). 


1. Moments and Centre of Mass. 


The moment with respect to the xy-plane of a particle with mass 
m is defined as the product mz of the mass and the z-co-ordinate. 
Similarly, the moment with respect to the yz-plane is mz and that 
with respect to the zx-plane is my. The moments of several particles 
combine additwely, that is, the three moments of a system of 
particles with masses m,, M9, ..., Mm, and co-ordinates (2,, ¥,, %); 
+++ (ns Yn; 2n) are given by the expressions 


n n nm 
Te=imge, Ty=imy, T,=Ximz,. 
v=1 v=1 ped 

If instead of a finite number of particles we are dealing with 
a mass distributed continuously with density p= p(s, y, 2) 
through a region in space or over a surface or curve, we define 
the moment of the mass-distribution by a limiting process, as in 
Vol. I, Chap. X, section 6 (p. 497), and thus express the moments 
by integrals. For example, with a distribution in space we sub- 
divide the region R into n sub-regions, imagine the total mass 
of each sub-region concentrated at any one of its points, and 
then form the moment of the system of these » particles. We 
see at once that as n->0 and at the same time the greatest 
diameter of the sub-regions tends to zero the sums tend to the 
limits 
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T.=f f { uedndyde, T, =f ff ydadyde, 
T, =f ff wededy de, 
Rg 


which we call the moments of the volume-distribution. 

Similarly, if the mass is distributed over a surface § given 
by the equations «= d(u, v), y= ¥(u, v), z= x(u, v) with 
surface density p(u, v), we define the moments of the surface dis- 
tribution by the expressions 


T= [uodo=f f poVEG—Fedude, 
Z, =f fuydo=f [ vyVEG— Fedudo, 


T, =f fuedo=f f pe EG — Fedudn. 


Finally, the moments of a curve x(s), y(s), 2(s) in space with mass 
density p(s) are defined by the expressions 


= f ‘ueds, Ty = ‘uyds, T,— f peds, 


where s denotes the length of arc. 
The centroid (centre of mass) of a mass of total amount M 

distributed through a region # is defined as the point with co- 

ordinates 

Me T, 


f= 1 9 


Lr, 


For a distribution in space the co-ordinates of the centre of mass 
are therefore given by the expressions 


é= it S [vmdedyde, &c., where M=f f { wdzdyde 


As an example we first consider the uniform hemispherical region H 
with mass density 1: 


e@+y+esi, 


z = 0. 
The first two moments 


Te =| ff xdvdyae, 
aA 
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Le =f ff vacayae 


are zero, since the integration with respect to x or with respect to y gives 
the value zero. For the third, 


T. = ff [ edndyae, 


we introduce cylindrical co-ordinates (r, z, 8) by means of the equations 


2=2, 
a =r cos, 
y=rasind 


and obtain 
V(1— 2%) 1] 22 (e ae n 
rd fa O= anf 7) 2dZ= % ime rar 
Since the total mass is 27/8, the co-ordinates of the centre of mass are 
z= 0,y=0,2= 3. 
We shall next calculate the centre of mass of a hemispherical surface 


of unit radius over which a mass of unit density is uniformly distributed. 
For the parametric representation 


z=cosusinv, y=sinusiny, z= cosy 


we calculate the surface element from the formula on p. 273 and 
find that 
V EG — Fidudv = sinv du dv 


We accordingly obtain 


T= sintode f' cosudu = 0, 


{2 
= [sin ode fs sinudu = 0, 


‘I 


a] 2. By wf 
T, =/ sino cosvdo f' du = 2x = =, =7 
0 


for the three moments. Since the total mass is obviously 27, we see that 
the centre of mass lies at the point with co-ordinates = 0, y = 0, z= 3. 


2. Moment of Inertia. 


The generalization of the concept of moment of inertia is 
equally obvious. The moment of inertia of a particle with respect 
to the x-axis is the product of its mass and p* = y? +23, that is, 
the square of the distance of the point from the z-axis. In the 
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same way, we define the moment of inertia about the z-axis of 
a mass distributed with density p(x, y, 2) through a region R 
by the expression 


Sf Loge + P)dadyde 


The moments of inertia about the other axes are represented by 
similar expressions. Occasionally the moment of inertia with 
respect to a point, say the origin, is defined by the expression 


Sf fuer y? + 2) dadydz, 


and the moment of inertia with respect to a plane, say the 


y2-plane, by 
ff ff vardedy de. 
Rk 


Similarly, the moment of inertia, with respect to the x-axis, of 
a surface distribution is given by 


J [ug + 2)do, 


where p(w, ) is a continuous function of two parameters u and v. 

The moment of inertia of a mass distributed with density 
u(x, y, z) through a region R, with respect to an axis parallel to 
the z-axis and passing through the point (€, 7, ¢), is given by the 
expression 


ine ji Hl(y — 9)? + @— $))dedydz. 


If in particular we let (€, 7, ¢) be the centre of mass (cf. p. 277) 
and recall the relations for the co-ordinates of the centre of mass 
(given on p. 277), we at once obtain the equation 


Jf fugr+ adadyde=f f fully—n) + @— OAdedyde 
+ (p+ O)f ff wdedyde, 


Since any arbitrary axis of rotation of a body can be chosen as 
the z-axis, the meaning of this equation can be expressed as 
follows: 

The moment of inertia of a rigid body with respect to an arbitrary 
awis of rotation is equal to the moment of inertia of the body about 
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@ parallel awis through its centre of mass plus the product of the total 
mass and the square of the distance between the centre of mass and 
the axis of rotation (Steiner’s theorem). 

The physical meaning of the moment of inertia for regions in 
several dimensions is exactly the same as that already stated 
in Vol. I, Chap. V, section 2 (p. 286): 

The kinetic energy of a body rotating uniformly about an axis 
ts equal to half the product of the square of the angular velocity and 
the moment of inertia. 


The following examples may serve to illustrate the concept and the 
actual calculation of the moment of inertia in simple cases. 

For the sphere V with centre at the origin, unit radius and unit density, 
we see by symmetry that the moment of inertia with respect to any axis 
through the origin is 


raf f foes wdedyde = ff fe + #)dudydz 
=f f fo + #)dxdy dz, 


If we add the three integrals, we obtain 
3I =f f fow+ y? + 2) dxdydz, 
v 


or, if we introduce polar co-ordinates, 


2 pl ; 2 9 8x 
= 3 rar ['sinodo [au = 5. 5.2.20 =F 


For a beam with edges a, b, c, parallel to the z-axis, the y-axis, and 
the z-axis respectively, with unit density and centre of mass at the origin 
we find that the moment of inertia with respect to the zy-plane is 


al2 bl /2 3 
f def a i Ade ab”. 
—aj2 J—1j2 Jee 12 
3. The Compound Pendulum. 


The above ideas find an application in the mathematical treatment of 
the compound pendulum, that is, of a rigid body which oscillates about a 
fixed axis under the influence of gravity. 

We consider a plane through @, the centre of mass of the rigid body, 
perpendicular to the axis of rotation; let this plane cut the axis in the 
point O (fig. 14). Then the motion of the body is obviously given if we 
state the angle @ = 9(t) which OG makes at time ¢ with the downward 
vertical line through O. In order to determine this function 9(t) and also 
the period of oscillation of the pendulum, we require to assume a know- 
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ledge of certain physical facts (cf. Chap. VI, section 1, p. 412), We make 
use of the law of conservation of energy, which states that during the 
motion of the body the sum of its kinetic and potential energies remains 
constant. Here V, the potential energy of the body, is the product Mgh, 
where M is the total mass, g the gravitational 
acceleration, and A the height of the centre of 
mass above an arbitrary horizontal line, e.g. 
above the horizontal line through the lowest 
position reached by the centre of mass during 
the motion. If we denote OG, the distance of 
the centre of mass from the axis, by s, then 
V = Mgs(1—cosp). By p. 280 the kinetic 
energy is given by 7’ = 419%, where J is the 
moment of inertia of the body with respect to 
the axis of rotation and we have written 9 for 
dq/dt. The law of conservation of energy there- 
fore gives the equation 


‘ 
We a ee 


41¢? — Mgs cos@ = const. Fig. 5 sealed Soenpesind 
pe 


If we introduce the constant 1! = I/Ma, this is 
exactly the same as the equation previously found (Vol. I, Chap. V, 
p. 302) for the simple pendulum; / is accordingly known as the length of the 
equivalent simple pendulum. 

We can now directly apply the formule previously obtained (Joc, cit.). 
The period of oscillation is given by the formula 


a be do 
pase [* 2 
g —,V 008 — COB Po 


where 9, corresponds to the greatest displacement of the centre of mass; 
for small angles this is approximately 


t Ape 
P= on a/2 = 2m Vigs’ 


The formula for the simple pendulum is of course included in this as a 
special case. For if the whole mass M is concentrated at the centre of mass, 
then J = Ms?, so that 1 = «. 

Investigating further, we recall that I, the moment of inertia about 
the axis of rotation, is connected with J,, the moment of inertia about a 
parallel axis through the centre of mass, by the relation (cf. p. 279) 

L=I1,+ Ms'. 
Hence 


I 
1 = 29. 
6+ U;’ 
or, if we introduce the constant a = I,/M, 


l=e+ 2, 
8 
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We see at once that in a compound pendulum ?/ always exceeds s, 
so that the period of a compound pendulum is always greater than that 
of the simple pendulum obtained by concentrating the mass UM at the 
centre of mass. Moreover, we note that the period is the same for all 
parallel axes at the same distance s from the centre of mass. For the 
length of the equivalent simple pendulum depends only on the two quan- 
tities s and a = J,/M, and therefore remains the same provided neither 
the direction of the axis of rotation nor its distance from the centre of 
mass is altered. 

If in the formula / = s+ a/s we replace the quantity s by a/s, that 
is, if the axis is moved from the distance s to the distance a/s from the 
centre of mass, then / remains unchanged. This means that a compound 
pendulum has the same period of oscillation for all parallel axes which 
have the distance s or a/s from the centre of mass. 


The formula 7 = 27x ar (Ge) shows at once that the period 7 
g 


increases beyond all bounds as s tends to zero or to infinity. It must 
therefore have a minimum for some value sg. By differentiating we obtain 


= —. flo 
&=Va= no 


A pendulum whose axis is at a distance 8) =+/I,/M from the centre 
of mass will be relatively insensitive to small displacements of the axis. 
For in this case dT /ds vanishes, so that first-order changes in s produce 
only second-order changes in JT. This fact has been applied by Prof. 
Schuler of Géttingen in the construction of very accurate clocks. 


4. Potential of Attracting Masses. 


We have seen in Chap. II, section 7 (p. 90) that according to Newton’s 
jaw of gravitation the force which a fixed particle Q with co-ordinates 
(& », ¢) and mass m exerts on a second particle P with co-ordinates (x, y, 2) 
and unit mass is given, apart from the gravitational constant y, by 


1 
m grad ~, 


where r = a/ (x — &)? + (y— ny)? + (2 — %)* is the distance between the 
points P and @. The direction of the force is along the line joining the 
two particles, and its magnitude is inversely proportional to the square 
of the distance. If we now consider the force exerted on P by a number 
of points Q,, G2, .... Q, with respective masses ™m,, Mg, «++ Mgy We can 
express the total force as the gradient of the quantity 


My 


m. ™ 
4+ ?7+...4—%, 
ry T. Tn 


where 7, denotes the distance of the point Q, from the point P. If a force 
can be expressed as a gradient of a function, it is customary to call this 
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function the potential of the force; we accordingly define the gravitational 
potential of the system of particles Q,, Q, «++» Q, at the point P as the 
expression 

n Mm, 


» 
vei (z — &)? + (y— a? + (2 — &)* 


We now suppose that instead of being concentrated at a finite number 
of points the gravitating masses are distributed with continuous density p. 
over a portion R of space or a surface S or a curve C. Then the potential 
of this mags-distribution at a point with co-ordinates (x, y, z) outside the 
system of masses is defined as 


ff (ae S) aeandt, 
J fie 
[ee 


In the first case the integration is taken throughout the region R with 
rectangular co-ordinates (&, y, ¢), in the second case over the surface S 
with the element of surface do, and in the third case along the curve with 
length of arc s. In all three formule r denotes the distance of the point 
P from the point (&, , ¢) of the region of integration and p the mass 
density at the point (&, y, ¢). 

Thus e.g. the potential at a point P with co-ordinates (x, y, z), due to 
a sphere K of constant density equal to unity, with unit radius and centre 
the origin, is given by the integral 


Nivea Se 
nV (e — E+ (y — a)? + @ — SP 
+1 +V (12) pt V (1-899 1 
= f dé f dy if = dt 
-1 J-va-# J-va-#-9) 7 

In all these expressions the co-ordinates (x, y, 2) of the point P appear, 
not as variables of integration, but as parameters, and the potentials are 
functions of these parameters. 

To obtain the components of the force from the potential we have tc 
differentiate the integral with respect to the parameters. The rules for 
differentiation with respect to a parameter extend directly to multiple 
integrals, and by section I (p. 218) the differentiation can be performed 
under the integral sign, provided that the point P does not belong to the 
region of integration, that is, provided that we are certain that there is 


no point of the closed region of integration for which the distance r has 
the value zero. Thus, for example, we find that the components of the 


or 


or 
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gravitational force on unit mass due to a mass distributed with unit 
density through a region R in space are given by the expressions 


Raf ff FR acanas, Fa— ff fs aeanae 
ra—f ff aanar, 


Finally, we point out that the expressions for the potential and its 
first derivatives continue to have a meaning if the point P lies in the 
interior of the region of integration. The integrals are then improper 
integrals, and, as is easily shown, their convergence follows from the 
criteria of section 5 (p. 257). 

As an example we shall caloulate the potential at an internal point 
and at an external point, due to a spherical surface § with radius a and 
unit surface density. If we take the centre of the sphere as origin and 
make the z-axis pass through the point P (inside or outside the sphere), 
the point P will have the co-ordinates (x, 0, 0), and the potential will be 


dco 
bed BY re ee et 


If we introduce polar co-ordinates on the sphere by means of the equations 


— = acos, 
7 = asin 6 cos, 
¢= asinO@ sing, 


2 gt ytd 
U= [5 —— a f de 
0 4/(% — cos 6)? + a? sin? 6 0 
_ anf a* sin 8 a6 
0 Vat+a?—2axcos8 


If we put a* + a? — 2az cos 0 = r4, so that ax sin 0d0 = rdr, then (provided 
that x - 0) the integral becomes 


then 


Qua plxtal rdr na 
= — — =-—(|#x#+a|/—|ze—-a]). 
xz [x—a| r £ 


For | «| > @ we therefore have 


ot 
[= 

and for|z| <a 
U = 4na. 


Hence the potential at an external point is the same as if the whole 
mass 4xa" were concentrated at the centre of the sphere. On the other 
hand, throughout the interior the potential is constant. At the surface 
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of the sphere the potential is continuous; the expression for U is still 
defined (as an improper integral) and has the value 41a. The component 
of force F,, in the z-direction, however, has a jump of amount —47 at the 
surface of the sphere, for if | «| > 4, we have 


4na? 
p=) v 


Py — 


while F, = 0 if |x| <a. 

The potential of a solid sphere of unit density is found from the 
above by multiplying by da and then integrating with respect toa. This 
gives the value 

4na® 

3| «| 
for the potential at an external point, which is again the same as if the 
total mass $xa? were concentrated at the centre. 


EXAMPLES 


1. Find the position of the centre of mass of the curved surface of a 
right cone. 

2. Find the co-ordinates of the centre of mass of the portion of the 
paraboloid 

2+ y= pe 
cut off by the plane z = 2. 

3*, A tube-surface is generated by a family of spheres of unit radius 
with their centres in the ay-plane. Let S be a portion of the surface 
lying above the zy-plane and II the area of the projection of S on the xy- 
plane. Prove that the z-co-ordinate of the centre of mass of J is equal 
to IL/s. 

4. Calculate the moment of inertia of the solid enclosed between the 
two cylinders 


e+yP=R and @+y=—R (R>R’) 


and the two planes z = h and z = —h, with respect to (a) the z-axis, (b) the 
%-axis. 

5. If A, B, C denote the moments of inertia of an arbitrary solid of 
positive density with respect to the x-, y-, z-axes, then the ** triangle in- 
equalities ” 

A+B>0, A+OC>B, B+CO>A 
are satisfied. 

6. Find the moment of inertia of the ellipsoid 

2 y  &@ 

vay 7% we] 

a * B “ 3 
with respect to 


(a) the z-axis, 
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(6) an arbitrary axis through the origin, given by 
@iy:2z=a:B:y (a+ B?+4 y2=— 1). 
7*, Find the envelopes of the planes with respect to which the ellipsoid 


2, ya 
—_— la —_=] 
a’ pta 


has the same moment of inertia h. 


8. Let O be an arbitrary point and S an arbitrary body. On every ray 
from O we take the point at the distance 1//I from O, where IJ denotes 
the moment of inertia of S with respect to the straight line coinciding with 
the ray. Prove that the points so constructed form an ellipsoid (the so- 
called momental ellipsoid). 


9. Find the momental ellipsoid of the ellipsoid 
x y? 22 
atetan! 
at the point (&, n, ¢). 


10. Find the co-ordinates of the centre of mass of the surface of the 
sphere a4 + y* + 2? = 1, the density being given by 


1 


11. Find the x-co-ordinate of the centre of mass of the octant of the 
ellipsoid 


_=1, z20,¥20,z220. 


12. A system of masses § consists of two parts 8, and S,; I,, Ip, I are 
the respective moments of inertia of S,, S,, 8 about three parallel axes 
passing through the respective centres of mass. Prove that 


MM, 
Mm, + m, 


T=h4+1,4+ @, 


where m, and m, are the masses of S, and S, and d the distance between 
the axes passing through their centres of mass. 


13. Calculate the potential of the ellipsoid of revolution 
a+ y! 


az 


22 
at its centre (6 > a). 
14. Calculate the potential of a solid of revolution 


r=V@e+y)Sfe), aSzSs, 
at the origin. 
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Appendix to Chapter IV 
1, Tae Existence oF THE MULTIPLE INTEGRAL 


1. The Content of Plane Regions and Regions of Higher 
Dimensions. 


In order to obtain the analytical proof of the existence of the 
multiple integral of a continuous function, we must begin with 
a study of the idea of content. 

In Vol. I, Chap. V (p. 269) we saw how the content of a plane 
region can in general be expressed by an integral. Without 
making use of that fact, and without considering the existence 
of the area as guaranteed by intuition, we shall now proceed to 
give a general definition of the idea of ‘“‘ content ” and investigate 
under what conditions this concept has a meaning. 

We begin with a rectangle with sides parallel to the z- and 
y-axes, and define the area of such a rectangle as the product of 
the base and the altitude. If the given rectangle is subdivided 
into smaller rectangles by a number of parallels to the sides, it 
is clear from this definition that the area of the rectangle is equal 
to the sum of the areas of all the sub-rectangles. The area of a 
region which is composed of a finite number of rectangles * can 
now be defined as the sum of the areas of these rectangles. 

The area thus defined is independent of the way in which the 
region is subdivided (or resolved) into rectangles. For if we are 
given two different resolutions, we can find a third resolution 
which is a finer subdivision of the two original ones. We do 
this by prolonging throughout the region all the lines which 
occur in either of the resolutions. These lines subdivide the two 
subdivisions into still smaller rectangles. The sum of the areas 
of these small rectangles is equal to the sum of the areas of the 
rectangles both of the first resolution and of the second resolution. 

Now in order to define the area of an arbitrary bounded region 
B we form an inner approximation and an outer approximation 
to the region, that is, we find two regions B, and B,, each con- 
sisting of rectangles, the region B, being entirely within B and 


* Throughout this section the word rectangle will always be understood 
to mean a rectangle with sides parallel to the axes. 
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the exterior region B, containing B. For this purpose we first 
enclose the region B in a large square. Then we divide this 
square into small rectangles by drawing parallels to the axes. 
Those rectangles having points in common with B together form 
a region B, which encloses B; those rectangles which lie wholly 
within B form a region B, which is contained in B. 

We now wish to define the area O(B) of B in such a way that 
for every choice of B; and B, the area of B lies between that of 
B, and that of B,: 


O(B,) S C(B) S C(B,). 


If we make the subdivisions finer, so that the diameters of the 
rectangles tend to zero, then the areas O(B;) form a mono- 
tonic increasing sequence and the areas C(B,) form a mono- 
tonic decreasing sequence. For to the regions B, rectangles can 
only be added, and from B, rectangles can only be removed. 
Therefore C(B,) has a limit and so has O(B,). If these two limits 
are equal, we call this common limit the area of the region B. 

Under what conditions are the two limits, C(B,) and O(B,), 
equal? Of course the answer is, when the difference C(B,) — C(B;) 
tends to zero as the fineness of the subdivisions increases. The 
region B, — B, consists of those rectangles which have points in 
common with the boundary of B. Therefore if the area of this 
region B, — B, tends to zero, it follows that the boundary of B 
can be enclosed in a region composed of rectangles and having 
as small an area as we please, namely in B, — B;. Conversely, if 
the boundary of B can be enclosed in the interior of a region S 
consisting of rectangles with a total area as small as we please, and 
if the subdivision is sufficiently fine, the rectangles B, — B, will 
all lie in S; the area of B, — B, will then be lese than that of S, 
so that it tends to zero. 

The result is as follows: the limits of C(B;) and C(B,) are equal 
of, and only if, the boundary of B can be enclosed in a region consist- 
ing of rectangles of total area as small as we please. In this case 
our definition actually does assign a content * to B. 


* From the geometrical point of view it is somewhat unsatisfactory that 
in defining the content we have singled out a particular co-ordinate system. 
As a matter of fact, however, there is no difficulty in showing that the content 
is independent of the co-ordinate system, not only for two dimensions but also 
for n dimensions. We shall, however, omit this discussion here. For, on 
the one hand, it is not necessary for our particular purpose, which is the 
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In the next sub-section (p. 291) we shall prove the intuitively 
plausible fact that every sectionally smooth continuous curve (that 
is, every continuous curve which has a continuously turning 
tangent except at a finite number of points) can be enclosed in a 
region formed from rectangles, whose area is as small as we please. 
The condition is therefore satisfied whenever the region B con- 
sists of a finite number of parts, each bounded by a finite number 
of sectionally smooth curves. Such regions have a unique area; 
others do not arise in practical applications. 

We shall show on p. 292 that if a region B is subdivided by 
sectionally smooth curves the sum of the contents of the sub- 
regions is equal to the content of the whole region B. Here we 
shall merely show that the present definition of area agrees with 
the integral formule obtained previously. 


xe 
Fig. 15.—Approximation to a region by sets of rectangles 


We begin by considering a region B bounded by the z-axis, 
the lines z = a, x = 6, and a curve y= f(x). For the regions B, 
and B,, respectively contained in and containing B, we can take 
the regions composed of rectangles shown in fig. 15 (the one by 
dotted lines and the other by continuous lines). According to 
the definition of a simple integral in Vol. I, Chap. IT, section 1 
(p. 78), the areas B; and B, are respectively an upper sum F,, 


and a lower sum F,, for the integral { ydz. In addition to our 
formula as . 
C(B,) S C(B) S CB.) 


proof of the existence of the double integral; and, on the other hand, the 
fact that the content is independent of the co-ordinate system follows imme- 
diately when we represent the content by a multiple integral and recall that 
the transformation formula shows that the value of this integral is unchanged 
when new rectangular co-ordinates are introduced. 

1 (8912) 
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we accordingly have the further inequality 


b 
OB.) Sf fe)dx < O1B.), 
by the definition of integral. Since lim C(B;) = lim C(B,), it 


b 
follows that C(B) = f Jf (z)dx, in agreement with what we have 
said already. 2 

In the case of an arbitrary region B, subdivision of the 
region by lines parallel to the axes shows that our definition 
of content agrees with the expression for the area: 


f f da dy. 


The present definition of the area can immediately be extended 
to three-dimensional regions, and in fact to regions in dimensions. 
The content of a parallelepiped with sides parallel to the axes is 
defined as the product of the lengths of the three sides. We then 
extend the definition to regions composed of a finite number of 
such parallelepipeds. For an arbitrary region B we then find 
regions B, composed of parallelepipeds and lying in B and similar 
regions B, containing B. The definition of the content of the region 
Bas the common limit of the content of B, and that of B,; again 
has a meaning, provided that the boundary of the region B can 
be enclosed in a set of parallelepipeds of arbitrarily small total 
content. In the next sub-section (p. 292) we shall show that this 
can always be done for regions bounded by surfaces having 
sectionally continuous tangent planes. As before, we shall hence- 
forth restrict ourselves to such regions. The word region is always 
to mean a bounded closed region whose boundary consists of a 
finite number of surfaces with sectionally continuous derivatives. 

The volume of a cylinder with its axis in the direction of the 
2-axis and its base in the zy-plane is the product of the area of the 
base and the altitude. This is at once clear when the base is 
composed of rectangles with sides parallel to the axes. In the 
general case the cylinder can be enclosed between two cylinders 
whose bases are regions composed of rectangles and whose 
volumes differ from that of the given cylinder by arbitrarily 
small amounts. The theorem therefore holds for cylinders with 
any base. From this it follows as before that the double integral 


[[ fe yazdy 


eS 
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gives the volume of a portion of space bounded above by the 
surface z = f(x, y), below by the plane region B, and at the sides 
by the vertical lines by which the edge of the surface is projected 
into the boundary of B. Further, we see that the definition of 
volume for a general region in space R agrees with the integral 


expression 
f f f dady dz. 


2. A Theorem on Smooth Arcs. 


In discussing areas we used the theorem that a continuous 
curve with a continuously turning tangent at all but a finite 
number of points can always be enclosed in a region composed 
of rectangles with sides parallel to the axes and having an arbi- 
trarily small total content. It is obviously sufficient to prove the 
theorem for the individual arcs with continuous tangents. Let 
such an arc be given by the equations 


c= 40) <3 
y= He) =" = 


where the parameter s is the length of arc and ¢(s) and f(s) are 
continuously differentiable functions. Then 


| ¢(s)| 31, 
[w(e)| <1. 


By the mean value theorem of the differential calculus, for any 
two values s and s, of s in the interval a < s < b we have 


|e— a |= | As) — 4(s,)|S|s—s, |, 
ly-“l= | #(s) — W(s,) |S] s— |. 
If, therefore, we subdivide the curve into ” arcs of length 


« = (6—a)/n and denote the initial point of the v-th are by 
(z,, y,) and an arbitrary point of that arc by (x, y), we have 


|ja—a2,|Se or &—eSersute 
ly—y,|Se or y,-eSySyte 


The points of the v-th arc therefore all lie in a square with side 
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2e and area 4e2. The whole curve is included in n such squares, 
whose total area is at most 


4e’n = 4e(b — a). 
This quantity can be made as small as we please by taking « 
sufficiently small. 


There is no difficulty in proving the corresponding theorem 
for surfaces in space defined by the equations 


== f(u, v) 
y = (u, 2) 
z= x(u, v), 


where the functions 4, #, x have sectionally continuous deri- 
vatives. It is found that every such surface can be enclosed in a 
region of arbitrarily small volume, consisting of a number of 
parallelepipeds. 

A consequence of this theorem is that if a plane region R 
bounded by a sectionally smooth curve is subdivided into two sub- 
regions R’, R’ which are separated by sectionally smooth arcs, 
the area of R is equal to the sum of the areas of R’ and R’. For 
we can subdivide the plane by straight lines parallel to the co- 
ordinate axes and so close together that all the rectangles which 
have points in common with the boundary of R or with the 
arcs separating R’ and R” have an arbitrarily small total area. 
As before, we define #, as the region consisting of all rectangles 
having points in common with R, and R, as the region consisting 
of all rectangles entirely within R; the regions R,’, R,', R,’, RB,’ 
are similarly defined. The regions R,’ and R,” together cover 
R,, some rectangles being counted twice; hence C(R,’) + C(R,”) 
= O(R,) = C(R). Again, R, and R, are contained in R,, and 
are completely separate; hence C(R) = C(R,) = C(R/’) + C(R,’). 
Since C(R,’) and C(R,’’) can be made to approximate as closely 
as we desire to C(R’) and C(R”) by making the subdivision fine 
enough, the first of these inequalities gives C(R’)-+ C(R”) = C(R); 
the second similarly gives C(R’)-+ C(R”) SC(R). Taken to- 
gether, these inequalities prove our statement. 

It is clear that this addition theorem still holds when the 
region & is subdivided into any finite number of regions R®, R®, 

.., R®, The extension to more than two dimensions follows the 
same lines and offers no difficulty at all. 


ie 


—— 
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3. The Existence of the Multiple Integral of a Continuous Function. 


Let the function f(z, y) be continuous in the interior and on 
the boundary of a region R. We wish to show that as the diameters 
of the sub-regions R, tend to zero the upper and lower sums 
im, AR,, UM, AR, (defined in Chap. IV, section 2, p. 224) tend 
to a common limit which is independent of the mode of sub- 
division. The proof is essentially the same as the corresponding 
proof in Vol. I, Chap. II, Appendix (p. 131), and can therefore 
be given quite briefly here. 

We first suppose that the subdivision of R into sub-regions R, 
is effected by polygonal paths. We choose the maximum diameter 
5 of the sub-regions R, so small that for every two points whose 
distance apart is less than 5 the values of the function differ by 
less than e. Then in each of these regions we have 


M,—m,<e. 


Thus for the difference between the upper sum and the lower sum 
we have 


=M, AR, — =m, AR, < XeAR, = <C(R). 


Every subdivision obtained by subdividing the given subdivision 
further obviously has a lower sum which is between the upper 
and lower sums of the original subdivision. 

The proof is complete once we show that for every two sub- 
divisions of R into sub-regions with diameters less than 8 the 
corresponding upper and lower sums of the two subdivisions 
differ from one another by as little as we please, provided only 
that 5 is chosen sufficiently small. 

If we are given a second subdivision into sub-regions R,’ 
which have diameters less than 8, then in this subdivision also 
the upper and lower sums will differ by less than «C(R): 


2M,’ AR,’ — Xm,’ AR,’ < «C(R). 


The two subdivisions together define a new subdivision which is a 
further subdivision of each of the two and which is obtained by 
collecting the common points of each pair of regions R, and R,! 
(if such points exist) into a region R,,””. By the previous remark, 
the lower sum of this third subdivision is not smaller than the 
lower sum of the two original subdivisions, and differs from 
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each of them by less than «C(R). Therefore the lower sums 
um, AR, and Xm,’AR,’ differ by less than 2«C(R). If we now 
let « tend to zero, it follows from Cauchy’s test that the lower 
sums have a limit independent of the mode of subdivision. 
- Since we have already seen that the upper sums differ from the 
lower sums by as little as we please, the upper sums have the 
same limit. This proves the existence of the double integral 


f f F(x, y)dS for polygonal subdivisions of R. 
R 


We made this assumption in order to be sure that a common 
subdivision into a finite number of regions R,,” really exists. 
If, for example, the boundaries of the sub-regions are curves, and 
a portion of a boundary curve in one subdivision consists of 
the line x = 0 and a portion of a boundary in the other consists 


of the curve 2” gia! == y, then the common subdivision will 
x 


have an infinite number of cells in the neighbourhood of z = 0. 
We can, however, easily get rid of this assumption of polygonal 
subdivision. For by p. 291 we can replace every curvilinear sub- 
division by a polygonal subdivision such that the total difference 
of the areas, and hence the difference of the corresponding lower 
sums, is arbitrarily small. This obviously reduces the case of 
sub-regions of arbitrary boundary to the special case already 
discussed. 

The proof is clearly independent of the number of dimensions. 

The corollaries on the existence of the double integral stated 
in Chap. IV, section 2 (p. 225) follow immediately from the 
approximation formula developed there and require no further 
proof here. 


2. GENERAL FoRMULA FOR THE AREA (OR VOLUME) OF A 
REGION BOUNDED By SEGMENTS oF SrraicHt LINES 
oR PLANE AREAS (GULDIN’s FormuLA). THE Potar 
PLANIMETER. 


The transformations on pp. 299-300 enable us to give a 
simple proof of the following theorems: 

If a straight-line segment S of constant or variable length | 
is in motion in a plane, and if ¢ represents time, then the area 
swept out by the moving segment is 


a 
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ty dn 
A= Ue) sae 


where t) and t correspond to the initial and final positions of 
the segment S, and dn/dt is the component of the velocity of the 
mean centre of S in the direction perpendicular to S. 
Again, the volume V swept out by a moving plane area P 
of area A is 
v= fan pa 


where dn/dt is the component velocity of the mean centre of the 
area A perpendicular to the plane of P. 

Both in these formule and in the proofs, we assume to begin 
with that the moving segment S or plane area A passes once and 


Fig. 16 


once only through each point of the region swept out (see fig. 16). 

We first give the proof in the case of a segment moving in a 
plane. The generating segment must be represented by an 
equation of the form 


a(t)e + By + yt)=0, a+ f=, (a) 
or else in the form obtained by solving this equation for the 
variable ¢: 

t= $2, y). 


We first carry out the transformation of 4 = =f f dx dy 
by means of the formula on p. 299 for the special case 


S (2, y) = 1, 


Denoting by ds the line element taken along the segment S, 
we obtain the expression 


: ds 
Bes) eae 
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for the area. It is easy to see, by substituting ¢ = (x, y) in 
formula (a) and differentiating with respect to « and y, that 


+(a’a + By + y’'). 


ame |= 
grad ¢ 


Hence the area is given by 
+A=fuf(a'ct Byt+y)d 
+ [ fie x By + y')ds 


Here a’, f’, y’ denote the derivatives of a, 8, y with respect to ¢. 
The integration with respect to s is to be taken along the seg- 
ment S. 

The single integral with respect to s is equal to 


WW (wX+ PY+y'), 


where (X, Y) are the co-ordinates of the mean centre of S. But 
X and Y satisfy the equation aX + BY + y= 0. On differen- 
tiating this equation with respect to t, we obtain 


a’X+ p¥ + y'+ aX’ + pY’=0. 
—(a’X + p'Y +’) = aX’ -+ BY’. 


Here a, B are the components of the unit vector perpendicular 

¢ to the segment S, and X’, Y’ are 
the velocity components of the 
mean centre at the time t. The 
expression a’X + f’Y + y’ is thus 
equal to the velocity of the mean 
centre perpendicular to S. This 
proves our formula. 

This result can be shown to be 
intuitively plausible by the follow- 
ing argument. We consider two 
neighbouring positions of the segment S, PQ and P’Q’, say (fig. 17). 
These two segments determine an area which is given approxi- 
mately by the product of the length PQ of S and the distance 
M'M of the mean centre of one segment from that of the other. 
The error in this approximation is of higher order than that of 
the increment of time &¢ corresponding to the displacement. It 


Thus 


Fig. 17 


eas 
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would be an instructive example for the reader to try to fill 
in the details of this geometrical argument and provide a strict 
proof. 

The corresponding theorem in three dimensions can be proved 
in the same way by the use of the transformation formule for 
volume integrals given on p. 300. There is no need to go through 
the proof here. 

In the special case of a plane region which is rotated about 
an axis while retaining its original size and shape, we have the 
problem already considered in Vol. I (Chap. V, p. 285), where 
Guldin’s rule for the volume of a solid of revolution was given. 

Our formule associate a definite sign with the area of the 
region swept out. In the two-dimensional case the sign depends 
on which of the two directions normal to S is regarded as posi- 
tive. (The same is true in three dimensions.) The area obtained 
is positive if the segment S, as it passes through any point, moves 
in the direction of the positive normal; otherwise it is negative. 

These observations allow us to extend our results to cases in 
which the segment or plane area does not always move in the 
same sense, or covers part of the 
plane (or space) more than once. 
The integrals given above will then 
express the algebraic sum of the 
areas (or volumes) of the parts of the 
region described, each taken with the 
appropriate sign. We leave it to the 
reader to work out how this may be 
taken account of in practice. 

As an example, let a segment 
of constant length move so as 
to have its end-points always on two fixed curves C and C’ in 
a plane, as in fig. 18. From the arrows showing the positive direc- 
tion of the normal we can determine the sign with which each 
area appears in the integral, and we find that the integral gives 
the difference between the areas enclosed by C and C’. If 
C’ contains zero area, as when it degenerates into a single seg- 
ment of a curve, multiply-described, the integral gives the area 
enclosed by C. 

This principle is used in the construction of the well-known 


polar planimeter (Amsler’s planimeter). This is a mechanical 
11° (8912) 


Fig. 18 
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apparatus for measuring plane areas. It consists of a rigid rod at 
the centre of which is a measuring-wheel which can roll on the 
drawing-paper. The plane of the wheel is perpendicular to the rod. 
When the instrument is to be used to measure the area enclosed by 
a curve C drawn on the paper, one end of the rod is moved round 
the curve, while the other is connected to a fixed point O, the pole, 
by means of a rigid member jointed to it. This end of the rod 
therefore describes (multiply) an arc of a circle: that is, a closed 
curve containing zero area. It follows that the normal motion 
of the mean centre of the rod gives the area enclosed by C, apart 
from the constant multiplier 1. But this normal component is 
proportional to the angle through which the measuring-wheel 
turns, provided that the circumference of the wheel moves on 
the paper as the rod moves, in which case the position of the 
wheel is only affected by the motion normal to the rod. 

In the instrument as usually constructed the wheel is not 
exactly at the centre of the rod, but this only alters the factor of 
proportionality in the result, and the factor can be determined 
directly by a calibration of the instrument. 


EXAMPLE 


Let S be a tube-surface (cf. p. 182) generated by a family of unit 
spheres whose centres lie on a closed curve C in the xy-plane. Prove that 
the volume enclosed by § is 7 times the length of C. 


3. VoLUMES AND AREAS IN Space OF ANY NUMBER 
oF DIMENSIONS 


l. Resolution of Multiple Integrals. 


If the region R of the zy-plane is covered by a family of curves 
$(x, y) = const. in such a way that through each point of & there 
passes one, and only one, curve of the family, we can take the 
quantity (x, y) = € as a new independent variable, that is, we 
can take the curves represented by (x, y) = const. as a family 
of parametric curves. 

For the second independent variable we can take the quantity 
y = y, provided that we restrict ourselves to a region 2 in which 
the curves 4(z, y) = const. and y = const. determine the points 
uniquely. 


ene da 
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If we introduce these new variables, a double integral 
f - J (2, y) dady is transformed as follows: 
Rk 


J [tle away =f [PSY agar, 


If we keep € constant and integrate the right-hand side with 
respect to 7, the integral with respect to 7 can be written in the 


form 
fpf p WP + $4 
V (be + $,*) be 


ds Vib2-+ $2) 
dn bz 


this integral may be regarded as an integral along the curve 
d(x, y) = &, the length of arc s being the variable of integration. 
Thus we obtain the resolution 


Jf fle, s)aedy = fa [Tee 


for our double integral. The intuitive meaning of this resolution 
is very easily recognized if we suppose that corresponding to the 
curves ¢(x, y) = const. there is a family of orthogonal curves 
which intersect each separate curve ¢ = const. at right angles, 
in the direction of the vector grad ¢. If the orthogonal curves are 
represented by the functions a(c) and y(c), where o is the length 
of arc on them, then 


Og Whi Y= aoe 
da V(b." + by) da V(be2 + $47) + $y?) 


Since 


Since 


dé, dx dy 
do be do + dy do’ 
we obtain 


= Vibe+ $8) = Vigad FE 


We now consider the quadrilateral mesh bounded by two curves 
f(z, y) = € o(a, y) = €+ AE, and two orthogonal curves which 
cut off a portion of length As from ¢(z, y) = €. The area of this 
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mesh is given approximately by the product AsAg, and this 
in turn is approximately equal to 


AsAé 
Vga? + $47) 
The transformation of the double integral, 


[[ fe. ndeay=f [40.9 Toe 


simply means this: instead of calculating the double integral by 
subdividing the region into small squares, we may use a subdivision 
determined by the curves (x, y) = const. and their orthogonal 
curves. 

A similar resolution can be effected in three-dimensional 
space. If the region R is covered by a family of surfaces d(2, y, 2) 
= const. in such a way that through every point there passes 
one, and only one, surface, then we can take the quantity 
£= ¢(2, y, 2) as a variable of integration. In this way we resolve 
a triple integral 


Sift y, 2) dady dz 


2 flay2) Vet bt + 6 
stew 


into an integration 


S(t, Y, 2 ‘ 
Saar irra 


over the surface ¢ = & and a subsequent integration with respect 
to & 


[fff ¥y; 2) da dy dz ft ea ($,7 fe oe + $7 


2. Areas of Surfaces and Integration over Surfaces in more 
than Three Dimensions. 


In n-dimensional space, that is, in the region of sets of values 


with ” co-ordinates, an (n — 1)-dimensional surface is defined by 
an equation 


(2, Va)... , Sp) = const. 


— 
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We suppose that a portion of this surface corresponds to a certain 
region B of the variables 2, 2, ... , Um, Where a, is to be 
calculated from the equation d(x, 2%, ... , Zn) = const. 

We now define the area of this portion of surface as the 
absolute value of the integral 


Aa ff. [Veet batt Pe) dasa... dtey 


In the first instance this definition is only a formal generalization 
of the formule for the area obtained by intuition in the case of 
three dimensions. Nevertheless, it has a certain justification in 
the fact that the quantity A is independent of the choice of 
the co-ordinate z,. This may be proved in the same way as for 
the three-dimensional case (cf. Chap. IV, section 6, p. 271). 

The integral of a function f(a, %,, ..., Zp) over this (n — 1)- 
dimensional surface we define as 


| f+ [flee 2 +145 %n)do 


=f f--[fe La, sy tq) Vibe Pe dy ey oe yy 


where, as before, we suppose that x, is expressed in terms of 
I, +++», Ly. by means of the equation ¢(a,..., %,) = const. 
We again find that the expression is independent of the choice 
of the variable z,. 

As in the case of two or three dimensions, a multiple integral 
over an n-dimensional region R, 


Lf Pl +s teddy +. dm 


can be resolved as on p. 300. We assume that the region R is 
covered by a family of surfaces 


(21, Ly, .. +» %_) = const. 


in such a way that through each point (2, ..., 2,) of R there 
passes one, and only one, surface. If instead of %,..., 2aa) Tn» 
we introduce 


By ve es Bngy C= Ply oo +s Bn)s 


as independent variables, the multiple integral becomes 
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I( peers n) Vv ale eee Pie) 
Jit fo fog $ - bon) dey... dtny 


es lrg TRE Wbo2-+ + $03)" 


3. Area and Volume of the 7z-Dimensional Unit Sphere. 


As an example we shall calculate the area and volume of 
the sphere in n-dimensional space, that is, the area of the 
(x — 1)-dimensional surface determined by the equation 


o?+...+a,2—= R 


and the volume interior to the (m— 1)-dimensional surface, 
which is the volume given by the inequality 


P+... 4+ 4,2 SR 
Let a continuous function f (r) of r= +/(#?+...+-«,7) be given 
inside the sphere. We shall first find the multiple integral 
f: : -f [Fe ae, ... da, over the sphere 72+ ...+ 0,2 R* 
We introduce the new variable 
72 = (ty, ... Bp) = TP... + By%, 
and in virtue of the relations 


V(b," + eae + bun?) = 2r, 
d(r?) = 2r dr 


we obtain the resolution 


ff LJilo)day doy =f fdr f... fdo= f Fey Qylonar, 


where Q,,(r) is the area of the sphere 72+ ...-+ 2? = 7. 
According to our general definition, the area of a hemisphere 


of radius r is given by the integral 
52x (r) = ae fas as est, 


where the integration is extended Sse the interior of the 
(mn — 1)-dimensional sphere 


get... + tyr. 
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If instead of the variables x, we introduce the quantities 
é="; Bee=l, 
r 1 


we obtain 


Oey Bet f., fF Bat = ry 


where we denote the area of the unit sphere ¢,?+4-...+ €,?=1 by 


= =2ff.. fa Et 


Then it follows that 


[fo [Pode ..- dy = On fy f(ryn-dr, 


We can now calculate w, conveniently from this formula; we 
extend the integration on the left: throughout the whole a,2,...2,- 
space (i.e. we let R increase beyond all bounds) and for f(r) 
we choose a function for which both the n-tuple integral on the 
left and the single integral on the right can be explicitly evaluated. 
Such a function is 


f(r) => eam attoatt.. ben) er. 


With this function the equation takes the form 


° e-#dar)' = wy [et m™tds, 
—« f) 


Since 
i ede = \/a 
(p. 262) and Bs 
ad 1lLf/n 
—r? nl ip — _ xis 
f er dr = sr) 
(p. 324), we obtain 
_ 2/7)" 
we Tp)” 


Here [ () means the elementary expression (C") if n 
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4 ioe iat 4/n if nis odd. For the general 


definition of the gamma function, see Vol. I, p. 250, and pp. 
323-5 of the present volume. In order to find the volume of the 
n-dimensional unit sphere we now put f(r) = 1 and obtain 


ta= fo. af [dey dy... dn = wy fete = 2; 


hence 


is even an 


War 
Tm + 2/2) 


Un 


4. Generalizations. Parametric Representations. 


In n-dimensional space we can consider an r-dimensional 
manifold for any r <n and seek to define its content. For this 
purpose a parametric representation is advantageous. Let the 
r-dimensional manifold be given by the equations 


ty = P(t + + +» Uy) 


. ° ° . . 


Gn = brl(Uy, «++ > Ur)s 


where the functions ¢, possess continuous derivatives in a region 
B of the variables (w,..., u,). As the variables w,, ... , wy, 
range over this region, the point (2%), ..., Z,) describes an r-dimen- 
sional surface. 

From the rectangular array 


Ou, Ou Ory 
On, Ont On 
Ug Oly OU 


CR OR a 
Ou, Ou, °° Ou, 


we now form all possible r-rowed determinants 


D, (-=1 seh (?)), 
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the first of which, for example, is the determinant 


0x, 0%, Oat, 
Gu, Ou," ” Guy 
Ox, Wl, OL, 
a 
do Ory Oa 
Ou, Ou,” Ou, 


The content of the r-dimensional surface is then given by the 
integral 


f.. [VDEF DP + Det diy... Mp. 


By means of the theorem on the transformation of multiple 
integrals (Chap. IV, section 4, p. 254), and simple calculations 
with determinants which we shall omit here, we can prove that 
this expression for the content remains unchanged if we replace 
the parameters u,,..., u, by other parameters. We likewise 
see that in the case r= 1 this reduces to the usual formula for 
the length of arc, and in the case r = 2 in a space of three dimen- 
sions it becomes the formula for the area. 

We shall give a proof for the case r = n — 1, where n is arbi- 
trary; i.e. we shall prove the following theorem: If f(a,...,%,)=0 
is an arbitrary (x — 1)-dimensional portion of surface in n- 
dimensional space, and if this portion can also be represented 
parametrically by the equations 


= Ut, oe > Un) C= 1, wey %), 


then its area is given by 
A=f.. [VDI+... + De -+ D,2du,... dup, 


where D, is a Jacobian of (n — 1) rows: 


D,= O(a 4, . - «> Leas Letyy + +» Gn) =f if O(Uy, -- » » Un) ; 
O(Uy, » 5 Uns) O(x4, 


ee Ve—-ys Vipyy ee ey Zn) 


Here, as always, we assume the existence and continuity of all 
the derivatives involved. 
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Without loss of generality we may assume that ¢,, + 0. 
Then, since by p. 301 A is given by 


A= fa. f BEd. deen 


we have only to show that 


1 
ae | grad |day... da,_4= VED i du, » +» Mega, 
In 
or 


| grad g [P= $.,%(ED2) Ce Pin EDA. 


O(a, «+» Vas) ni 
Now from the properties of Jacobians we have 


D, __ my, 5 Bay By +++» Tn) [ty - ~~, Sua) 


D, O(t4; Sry Un—1) O(u4, ess Un) 


O(hy, +5 Ln) 


This last Jacobian corresponds to the introduction of (a,..., 
Ly, Lytyy+++,%,) instead of (%,...,%_-3) a8 independent 


variables. But as the partial derivatives an are obtained from 
the equations u% 


TAL ee (@=1,...,n—1), 
Ox; 


we have —" = + —*, Hence 
s pin 
D/? = pu,” 
D,? Pin? 


which proves the formula for A. 
It may be mentioned here that the expression 2D, may be 


represented as a determinant of (n — 1) rows, ; 
2 
P Xu, My, Xue. Xu, Rug, 
ZDI= | Xu, %u|= ita Shes ba Gaver, «| ees 
i Renny Ma oes ose Keb, 


so that 
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A=f...[VG@du... dtins. 


Here the elements of the determinant are the inner products of 
the vectors %,, = ‘& On and Xy, = (= ee =") 
gs Ou; ” Ou, * OU, * Ou)” 
ie. the expressions 
50s Os 
1 Ou, Ou, 


EXAMPLES 


1. Calculate the volume of the n-dimensional ellipsoid 
ao a as 
a, + eee t ani . 


2. Express the integral J of a function of x,, depending on 2, alone, 


over the unit sphere z,7-+...-+ %,7= 1 in n-dimensional space, as a 
single integral. 


4, Improper INTEGRALS AS FUNCTIONS OF A PARAMETER 


1. Uniform Convergence. Continuous Dependence on the Para- 
meter. 


Improper integrals frequently appear as functions of a para- 
meter; thus e.g. the integral of the general power 


1 d 1 
eras 


in the interval —1 < x < 0 is an improper integral. 

We have seen that an integral over a finite interval is con- 
tinuous when regarded as a function of a parameter, provided 
that the integrand is continuous. In the case of an infinite 
interval, however, the situation is not so simple. Let us consider 


e.g. the integral 
F(z) = f a dy. 


According as x > 0 or x < 0, this is transformed by the substi- 
tution zy = z into 


00° 00 os 
sin z s1nz sin z 

f —— dz or f a= —f —- az 
z 0 2 0 2 


0 
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The integral i = dz converges, as we have seen in Vol. I (pp. 
fc) 


252, 418), and in fact it has the value 7/2 (Vol. I, p. 450, and 
p- 315 below). Thus in spite of the fact that the function 
(sinay)/y, regarded as a function of x and y, is continuous 
everywhere and its integral converges for every value of 2, 
the function F(x) is discontinuous; it is equal to 7/2 for positive 
values of x, to —2/2 for negative values of 2, and to zero for 
a= 0. 

In itself this fact is not at all surprising, for it is analogous 
to the situation which we have already met with in the case of 
infinite series (Vol. I, Chap. VIII, p. 394), and we must 
remember that the process of integration is a generalized sum- 
mation. In the case of an infinite series of continuous functions 
we required, if we were to be sure that the series represented a 
continuous function, that the convergence should be uniform. 
Here, in the case of convergent integrals depending on a para- 
meter, we shall again have to introduce the concept of uniform 
convergence, 

We say that the convergent integral 


F(a) =f fe, yay 


converges uniformly (in x) in the interval ax < Bf, provided 
that the “ remainder” of the integral can be made arbitrarily small, 
simultaneously for all values of x in the interval under consideration; 
more precisely: provided that for a given positive number ¢ 
there is a positive number A = A(e), which does not depend on a 
and is such that whenever B= A 


uc te, y)dy | <e. 


As a useful test we mention the fact that the integral 
f f(x, y)dy converges uniformly (and absolutely) if from a point 
Y = JY onward the relation 

[fe w1<= 


holds, where M is a positive constant and a> 1. For in this case 
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1 


dy 
— eee <a ees eee 
> uf on (a— ae =" @—)h4t 


| LF nay 


the right-hand side can be made as small as we please by choosing 
A sufficiently large, and it is independent of x. This is a straight- 
forward analogue of the test for the uniform convergence of 
series given in Vol. I, p. 392. 

We readily see that a uniformly convergent integral of a con- 
tinuous function is itself a continuous function. For if we choose 
a number 4 such that 


| {Fe way |<e 
for all values of x in the interval under consideration, we have 


Fle -+ h) — Fla) <| [ife+ho—fe way +26. 


In virtue of the continuity of the function f(z, y) we can choose 
h so small that the finite integral on the right is less than e, 
which proves the continuity of the integral. 

A similar result holds when the region of integration is finite, 
but the integrand has a point of infinite discontinuity. Suppose 
e.g. that the function f(a, y) tends to infinity as y-» a. We then 
say that the convergent integral 


F(a) =f fe, vay 


converges uniformly in aS xB if for every positive number « 
we can find a number k such that 


|<« 


provided hk, where k is independent of x. Uniform convergence 
in this sense occurs if in the neighbourhood of the point y = a the 
relation 

M 

2, y) |< ——~— 
| f(@, ¥) | Garay 
holds, where as before M is a positive constant and y <1. Just 
as above, we show that in the case of uniform convergence F(a) 
is a continuous function. 
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If the convergence is uniform, the improper integrals F(x) 
are continuous in a certain interval, say ina <2< B. We can 
then integrate them over this interval and thus form the corre- 
sponding improper repeated integral 


us ‘de i (@, y)dy 


or ‘ 
f ‘a i, F(a, y)dy. 


Instead of the finite interval a <a < B we can of course also 
consider an infinite interval of integration. 


2. Integration and Differentiation of Improper Integrals with 
respect to a Parameter. 


It is not true in general that improper integrals may be 
differentiated or integrated under the sign of integration with 
respect to a parameter. In other words, these operations are not 
interchangeable in order with the original integration (cf. the 
example on p. 316). 

In order to determine whether the order of integration in 
improper repeated integrals is reversible, we can often use the 
following test, or else make a special investigation on the lines 
of the following proof. 

Uf the improper integral 


Fo) =f fla, yay 
converges uniformly in the interval a Sx < Bf, then 
6 pw © AB 
faef Fe, y)dy =/ dy [ f(x, y) da. 
To prove this we put 
co) | 
i F(@, y)dy = f f(a, y)dy + R,(2). 
Then by hypothesis | R,(x) |< (A), where «(A) is a number 
depending only on 4 and not on a and tending to zero as A > 0, 


In virtue of the elementary theorem for ordinary integrals we 
have 
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[ef He, nay =f def Hee, Nay +f Rle\de 
= if Wy f fee i R,(2) de, 
whence by the mean value theorem of the integral calculus 
| aef Fe y)dy — fay ff y)de |< <(4)|B—a 


If we now let A tend to infinity, we obtain the formula stated 
above. 

If the integration with respect to a parameter also takes 
place over an infinite interval of integration, the change of order 
is not always possible, even though the convergence is uniform. 
It can, however, be performed if the corresponding improper 
double integral exists (cf. Chap. IV, section 5, p. 262 e& seq.). 
Thus e.g. 


. 


7 de i ‘fle, yay = f ‘dy iy Fle, y)de, 


if the double integral f f F(x, y)dady over the whole first quadrant 
exists. 

The proof of this follows from the fact that the improper double 
integral is independent of the mode of approximation to the region 
of integration. In one case we perform this approximation by 
means of infinite strips parallel to the z-axis, in the other by 
strips parallel to the y-axis. 

A similar result also holds if the interval of integration is 
finite, but the integrand is discontinuous along a finite number of 
straight lines y = const. or on a finite number of more general 
curves in the region of integration. The corresponding theorem 
is as follows: 

If when x lies in the interval a = x & B the function f(x, y) is 
discontinuous only along a finite number of straight lines y = a,, 
Y= 4,...,y = 4,, and of the integral 


Jf ‘fe, nay 


converges uniformly in x, then in this interval tt represents a con- 
tinuous function of x, and 
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feef Fe, y)dy =f yy f "fla, y) da 


That is, under these hypotheses the order of integration can be 
changed. The proof of the theorem is analogous to that given 
above. 


It is equally easy to extend the rules for differentiation with 
respect to a parameter. The following theorem holds: 

If the function {(x, y) has a sectionally continuous derivative 
with respect to x in the interval a <x S B and the two integrals 


P(e) =f fle y)dy and f fc, y)dy 
) 0 
converge uniformly, then 


P= "falar y) dy. 


That is, under these hypotheses the order of the processes of 
integration and of differentiation with respect to a parameter can 
be interchanged. For if we put 


Ga) =f fale, y)dy, 
0 
then, using the theorem of interchangeability just proved, we have 
é co) tea) é 
[ada =f da f’ fale, ydy =f dy f fale, yaa. 
The integrand on the right has the value 


€ 
J felt y)du= f(y) — fla y); 


therefore 
€ 
J Ga)da = Fé) — Fla); 
hence if we differentiate and then replace £ by x we obtain 


aF(x) Ory ie 
“ae = Ue) =f fale, dy, 


as was to be proved. 
We can similarly extend the rule for differentiation when one 
of the limits depends on the parameter z. For we can write 
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[Pfeny=f' fe nay+] fend, 
(x) (x) a 


where @ is any fixed value in the interval of integration, and we 
can then apply rules previously proved to each of the two terms 
on the right. 

As above, our rules of differentiation also hold for improper 
integrals with finite intervals of integration. 


3. Examples. 
1. As an example we consider the integral 


se 1 
i eV dy = -, (a > 0) 
0 xz 


If + = 1 this integral converges uniformly, since for positive values of A 
we have 


@a fe} 
foray sf evay= e4, 
A A 


where the right-hand side no longer depends on x and can be made as small 
as we please if we choose A sufficiently large. The same is true of the 
integrals of the partial derivatives of the function with respect to x. By 
repeated differentiation we thus obtain 


: iL 7", 2 aX nt 
[yma = 5 [vetay = seeee [vet = ser 


If, in particular, we put z= 1, we have 
ao 
In + 1) ={ yre Vdy = nl. 
0 


This formula has already been established in a different way in Vol. I, 
Chap. IV (p. 251). 
2. Further, let us consider the integral 
~~ dy nil 


0 Eg + y? a 2 x 
Again it is easy for us to convince ourselves that if x 2 a, where a is any 
positive number, all the assumptions required for differentiation under 
the integral sign are satisfied. By repeated differentiation we therefore 
obtain the sequence of formulz 


ie Oe SRE of See ee 

o @+y)? 2 2 a f (+ y7)8 2 2.4 oh °° 
1.3....(2n—3) 1 
2 


i] 


f ay 7”, 
o (2+ yy)" 2 


04. ... (Qn — 2) gerd 
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From these formule we can derive another proof 


(Crap. 


of Wallis’s product 


for (cf. Vol. I, Chap. IV, section 4, p. 224). For if we put 2 = Vn, we have 


? dy _ 1.38... (2n— 3) 
f + yy ~ 2°2.4... Qn — 3) V™ 


As n increases the left-hand side tends to the integral if ‘CV dy = h4/x. 
0 


For the difference 

00, A re) dy 

ev _— a 
feve—f aap 


satisfies the inequality 
T 
< [ 
) 


[ery =), : (i Su 


e~v 


=a as 


dy 


y?/n)n 


fe a) fo 0) 
7 dy 
+f evay re ny 


or, since (1 + y?/n)" > y, 


fer" atin 


ev" Peet enn 


~ (1+ y/n)" 


r 
<f 
0 


e 1 
dy +f evdy + i 
r 


o 
But if we choose 7’ so large that {| e-¥"dy-+ - < = and then choose n so 
T 


large that 2 


2 € 
dy <5 


T 
ie 
0 


1 
(+ xn 


as is possible in virtue of the uniform convergence of the process 


lim (1+ y/ny* =e, 
u—> 2 


it follows at once that 


This establishes the relation 


1.3...(2n—38 1 


which is equivalent to that obtained in Vol. I, p. 224. 


3... (2m— 38) 
eee Cat as 


i deere aa e | ve, 


> 


CO _« 
3. With a view to calculating the integral if od dy, we shall dis- 
0 


0 ; 
cuss the function F(x) = f ey ee dy. This integral converges uniformly 
0 


if a 2 0, while the integral 


nH 


f[ e-*Y sin y dy 
0 
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converges uniformly if = 8 > 0, where 8 is an arbitrarily small positive 
number. Both these statements will be proved below. Therefore F(z) is 
continuous if x = 0, and if x = 8 we have 


oO 
F(x) = =f e-*¥ sin y dy. 


We can easily evaluate this last integral by integrating by parts twice; 
we obtain 
1 


F(x) =e iq a 


From this we can find the value of F(x) by integration; this value is 
F(x) = are cotz + C, 


where C is a constant. In virtue of the relation 
° sin 

di) e-ay SRY dy 
0 y 


which holds if z = 8, we see that lim F(x) = 0. Since lim are cota = 0, 
xa x—>o 
C must also be 0, and we obtain 


ne | 


OQ 
<[ owdy=-— | =- 
ford = =| =3 


F(x) = arc cota, 


On account of the continuity of F(x) for x 2 0, 


lim F(x) = F(0) -( = dy, 


x—>0 


. . : wT 
which, since lim arc cot” == 


, gives the required formula 
x—>O0 2 


?siny T 
——4 dy = — 
i! y U8 


(cf. Vol. I, p. 450, footnote). 
We now return to the proof that 


© i, SINY 
ety = df 
f y ‘i 


converges uniformly if x = 0. If A is an arbitrary number and kz is the 
least multiple of x which exceeds A, we can write the “ remainder ” of 
the integral in the form 


od i Fer i 0 (v + 1) i 
f en ey siny dy = f ent siny dy+ & f erty siny dy. 
A y 4 y vak dun y 


The terms of the series on the mght have alternating signs and their absolute 
values tend monotonically to zero. By Leibnitz’s test (Vol. I, p. 370). 
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therefore, the series converges, and the absolute value of its sum is less 
than that of its first term. Hence we have the inequality 


© ny SiNY e+Dr | siny | (k+1)r ] Qn 
ey SOY <f enw LSI <f idy<= 
i) i] 2 4 y id 4 7 laa 


in which the right-hand side is independent of x and can be made as small 


as we please. This establishes the uniformity of the convergence. The 
uniform convergence of 


- 
few siny dy 
t) 


for z = 8 > 0 follows at once from the relation 


i 


On p. 310 we learned that uniform convergence of the integrals is a 
sufficient condition for interchangeability of the order of integration. Mere 
convergence is not sufficient, as the following example shows: 

If we put f(x, y) = (2 — xy)xye-”, then since 


00 e742 e748 


v4) 
e-*V siny dy Sf dy = a 
A = 8 


7) 
f(% y= ay (xye-™¥), 


.O 
the integral if (2, y)dy exists for every x in the interval 0 <2 <1, and 
0 


in fact for every such value of a it has the value 0. Therefore 


j ide if Flu ody = 0. 


On the other hand, since 
é 
f(a, y) = = (atye-™), 
for every y = 0 we have 


f Tes y)du = ye, 
0 


and therefore 


faf te y)dx =fyerdy = ford = 1 


ff te y)dy +fav [te y) da. 


Hence 
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4. Evaluation of Fresnel’s Integrals. 


The integrals 
+00 +o 
F,= is sin(r®)dr, F, = f. cos(7?)dr, 


which are of importance in optics, are known as Fresnel’s integrals. 
In order to evaluate them, we apply the substitution 7° = ¢, 
obtaining 


FRy= Se F, te oo it 
0 
Here we put 
1 


2 = oot 
woe 


(this follows from the substitution «= 7//t) and change the 
order of integration, as is permissible by our rules. Then 


R= J. feof e~*" sintdt, r= J facf e-*"t costdt. 


The inner integrals are easily evaluated by integration by parts, 
and F, and F, reduce to the elementary rational integrals 


2 7 1 2 77 w@ 
F,=—-| ——d&, fF, =— |] — 4 
Vato 1 at : Vale 


The integrals may be evaluated by the methods given in Vol. I 
(cf. Vol. I, p. 284); the second integral can be reduced to the first 


by means of the substitution 2’ = hs and both have the value 
x 


7 


——. That is, 


2/2 


EXAMPLES 
ao 
1. Evaluate f are—** das, 
0 
2. How must a, 6, ¢ be chosen in order that 


ea i} 
f Hi e—(axt+ 2bxy tev) dady = 12 
—-* 
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3. Evaluate 


+o +a 
(a) i f e~(ast+2bxy toy") And 4 2Bay + Cy2)dady, 
00 


—o 


+0 +00 
(b) i f e—(ax?+ Bboy + ey ag? 4+ Qbay +- cy?) dz dy. 
—-0 J—« 
(a > 0, ac — b8 > Q). 


4. Evaluate the following integrals: 


~o 
(a) K(a) =[ e— cogada. 
0 
© g~bx __ g—ax 
(b) ch oe noande, 
0 x 


(c) I(a)= f cet alett don 
0 


© gi J o(b 
(d) f eee) dz (where Jy denotes the Bessel function 
0 a defined in Ex. 4, p. 223). 


™ sin? aa 


n 
5*. Prove that f dz is of the order of logn when n is large, 


and that 


xz 


© gin?axz — sin? bx a 
f i dx = 4 log e 


fos) 
6. Replace the statement “the integral fA f(x. y)dy is not uniformly 
i) 


convergent” by an equivalent statement not involving any form of the 
words “‘ uniformly convergent”. (Cf. Vol. I, p. 45, Ex. 1.) 


5. THe Fourrer Inrecran 
L. Introduction. 


The theory given in section 2, p. 310 e¢ seg. is illustrated by 
the important example known as Fourier’s integral theorem. It 
will be remembered that Fourier series give a representation of 
a sectionally smooth but otherwise arbitrary periodic function in 
terms of trigonometric functions. Fourier’s integral gives a corte- 
sponding trigonometrical representation of a function f(x) which 
is defined in the whole interval —0 < %<-+ © and is not sub- 
ject to any condition of periodicity. 

We shall make the following assumptions about the function 


f(a): 
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(1) f(x) is sectionally smooth; that is, the function f(x) and 
its first derivative are continuous in any finite interval, except 
possibly for a finite number of jump discontinuities. 

(2) The integral 


[\fe|a—=c 


is convergent. 

(3) At a discontinuity x of the function it is assumed that 
f(x) is the arithmetic mean of the limits on the right and left. 
Thus 


f(x) = (fe + 0) + fle — 9). 


Fourier’s integral theorem may then be stated as follows: 
1 00 OO 
=_o]d t t— x)dt 
fla) == | dxf f(@) cosa(t — a)d, 
or, in complex notation, we have the equivalent formula 
eee Or at —in(t—») 
flo) = = f dxf fea. 
We may also state the theorem in the following form: if 


Lf” , 
71) = the—*" dé, 
He) = F=f fo 
then 
1 

V2 
The two formule last written are reciprocal equations for 
f(x) and g(x), each equation being the solution of the other. If 
the variable p= 7/2z is introduced and finally replaced by + 


again, we can express Fourier’s integral theorem by means of 
the two reciprocal formule 


io) =f “flbe-*a, f@= fo hte?" dt, 


f@= P"atne dr. 


where 
hr) = V 2a g(2z7). 


We shall give some examples to illustrate this theorem and 
then proceed to the proof. We first observe that if f(x) is an 
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even function—ie. if f(x) = f(—z2)—then a short calculation 
shows that the theorem may be stated in the simplified form 


7 2 oO oa} 
S(#) = . f cos (rx) dr i F (0) cos (rt) dt. 


Tf, on the other hand, f(z) is an odd function—ie. if 
J (*) = —f(—2x)—we obtain in the same way 


f(a) = Es f ‘sin (7a) dr [ F(t) sin (xt) de. 
EXxaMPLes 


1, Let f(z) = 1 when 2? < 1, f(z) = 0 when x? > 1. Then 


fle) = : i, ‘00a (cat) de f wend 


A 0, 2>1 
oe re 4, gta 
mo - 1, @<il. 


The integral on the right has played a part in mathematical literature 
under the name of Dirichlet’s discontinuous factor. 


2. Let f(x) = e-** (& > 0) when x > 0 and f(x) = f(—z). It is easy 
to show that 


2 pe « 2 f° k cos(t2x) 
ae kt = -- romnrai ° 
f(x= ak cos (ra) dxf cos (tt) dé Jp 2+ 8 dr 


But if we put f(—2x) = —f(x), we obtain 


2” % . 2 f° + sin(c2) 
=—f| si kt dt = — — - dt. 
F(x) au sin(ve)de f' sin (ét) dé al Boo T 
Hence we obtain the two integral formule 
® eos (v2) we en ke %~ gin (72) Tw 
= = dt = — e **, k> 0, 
0 poe 2k’ jy +78 ae oe ca 


3. The function f(a) = e—*"/2 gives an interesting illustration of the 
reciprocal formule. Since 


yA oO 
fz [ene cos ttdt = e—T/2 
ae) 


(see p. 318, Ex. 4a), the two reciprocal formule for g(t) and f(x) 
coincide. 
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2. Proof of Fourier’s Integral Theorem. 


The essential steps in the proof of Fourier’s integral formula 
are a transformation and a simple limit operation applied to 
Dirichlet’s limit formula 


afta) = tim [fo as, 

which holds for arbitrary positive values of a. We shall first 
prove this formula, although the substance of the proof has been 
given in Vol. I, Chap. IX, § 5 (p. 450). We rely on the elementary 
limit formula (cf. Vol. I, Chap. IX, p. 448) 


. B . ae 
Tim f ain (At) s(t)dt = 0, 


which holds when s(t) is continuous or sectionally continuous in 
the interval a S¢ SX f but is otherwise arbitrary. 
Let us first consider the interval from 0 to a. In this interval 


s(f) = —fersnfers 


1s a sectionally continuous function which, by the assumptions 
about f(x”), must have the limit f’(z -+ 0) as ¢ tends to zero. Thus 


Lae + yO) ae = [pee + 0) 22) a + faq) sim 4) a 


and the elementary formula given above shows that the last 
integral on the right tends to zero as A tends to infinity. 
The first integral on the right has the limit 


Jim fle+0)f BS do= fer of 82 do % fla+0) 


(cf. p. 315). If we now apply the corresponding argument to the 
integral from —a to 0, we obtain Dirichlet’s formula. 

The next step in the proof of Fourier’s theorem is the sub- 
stitution of the expression 


in(At) _ 
sa -/ 


cos(tr) dr 
) 


12 (2912) 
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in Dirichlet’s formula. We also introduce the notation 
a . a A 
[pe +e a= [fe + taf cos (tx)dr 
—@ t -a 0 
aN a 
=farf f(a@+ 8) cos(tz)dt = F(A, a). 
0 -—a 


Dirichlet’s formula then states that 
af (z) = lim F(A, a). 
A—>@ 


Since this limit is independent of a, we may write 
af (x) = lim lim F(A, a). 
a—>o A> 


If it were permissible to interchange the order of the limit 
operations in this formula, that is, if we might take the limit as 
a tends to infinity under the sign of integration, we should at 
once have 


A» ) Ce) 0 
af(z)= Yim f dr 8 fla-+#) cos tr) dé= is dr f. fle-+1) 008 tn) dt 


This immediately gives Fourier’s integral formula if we write 
a-+t=¢' and then replace ¢’ by ¢t. Thus the proof will be com- 
plete if we establish the change of order of limit operations 
km lim F(A, a) = lim lim F(A, a). 
amen A> @ »—> 2 aa 
Our previous work (p. 310; cf. also p. 104) shows that it is 
sufficient to prove that the limit 
lim F(A, a)=f f(e+ 1) Sin) a 
a—>o —-2 t 
exists uniformly with respect to A. 
To prove this, we must show that if ¢ is given in advance 
we can find A independent of A, such that | F(A, a)— F(A, 6) | <e 
whenever @ and b both exceed A. But 


[Fa — FO Sf feta [LO a 


oe | sin (Az) | 2° 
+f e+ ole des 7) Ife) la 
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It follows at once that 
| F(A, a) — F(A, 8) | <2, 


so that it is only necessary to take A = 2C/e. This gives the proof 
of uniform convergence, and completes the proof of Fourier’s 
integral theorem. 


6. Taz Evierran INTEGRALS (GaMMa Function) * 


One of the most important examples of a function defined 
by an improper integral involving a parameter is the gamma 
function I'(z). Here we shall give a fairly detailed discussion 
of this function. 


1. Definition and Functional Equation. 


The function (a) is defined for every « > 0 by the improper 
integral 


re) =/ ete ld, 


In Vol. I, Chap. IV, pp. 250-1, we studied this integral for integral 
arguments «=m. The method used there shows at once that 
the integral converges for any «> 0, the convergence being 
uniform in every closed interval of the positive z-axis which does 
not include the pomt z= 0. The function I(x) is therefore con- 
tinuous for x > 0. 

By simple substitutions we can transform the integral for 
['(z) into other forms which are often used. Here we only 
mention the substitution ¢= u?, which transforms the gamma 
function into the form 


Tio) = 2f eww du, 
(x) f eu U, 
Thus the frequently-occurring integral 


oO 
f eu du (a >—1) 
0 


* A discussion closely related to the present one is given by E. Artin, 
Einfihrung in die Theorte der T-Funktion (Leipzig, 1931). 
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can be expressed in terms of the gamma, function as 


[evan ain ¢ a *) 
0 2 2 


(cf. section 3, p. 303). 
Integration by parts shows, as in Vol. I, p. 251, that the 
relation 


T(x + 1) = #I(2) 


holds for any e>0. This equation is called the functional 
equation of the gamma function. 

Of course I(x) is not uniquely defined by the property of 
being a solution of this functional equation. In fact, we obtain 
another solution merely by multiplying I(x) by an arbitrary 
periodic function p(x) with period unity. On the other hand, 
the functions 


u(x) = T(x)p(z), = p(w + 1) = (2) 


represent the aggregate of all solutions of the equation; for if 
u(x) is any solution, the quotient 
f(@)= 


u(x) 
Toy 


which can always be formed since I(x) + 0, satisfies the equation 
f@+)=f(2). 


Instead of the function I'(z), it is frequently more convenient 
to consider the function u(x) = logI'(x); since I(x) > 0 for 2 > 0, 
this is always defined. The function satisfies the functional 
equation (difference equation) 


u(a@ + 1) — u(x) = loge. 


We obtain other solutions of this equation by adding to 
logI'(a) an arbitrary periodic function with period unity. In 
order to specify the function log I(x) uniquely, therefore, we must 
supplement the functional equation by other conditions. One 
very simple condition of this type is given by the following 
theorem, due to H. Bohr: 

Every convex solution of the difference equation 


ula + 1) — u(x) = loge 
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is identical with the function logT'(x) in the interval O< x<&, 
except perhaps for an additive constant. 

2. Convex Functions: Proof of Bohr’s Theorem. 


We say that a function f(x) is convex in a region aSaSb 
if for every two points x, and 2, of the region and every two 
positive numbers a, 8, where a-+ B = 1, the expression 


af (a) + Bf(%) — flax, + Bate) 


never changes sign; or, intuitively speaking, if the chord joining 
two points of the curve y= (f(z) either never lies beneath or 


y 


f(a x, + B%X2) 


Ul 
Ly LX, +BX2 Hg 


Fig. 19.—A function which is convex downwards 


never lies above the arc of the curve itself between 7, and 2, 
(cf. fig. 19). (Cf. also Chap. I, section 1, p. 8 and Chap. II, p. 100.) 

Before proving this theorem we shall establish certain pro- 
perties of convex functions. We restrict ourselves to functions 
which are “convex downwards ”, for which 


af (%) + Bf (a2) — f(ax, + Bx) 20 


39 


holds; functions which are “ convex upwards” can always be 
changed into functions which are “convex downwards” by 
multiplying by —1. 

If a convex function f(x) is twice continuously differentiable, 
the expression 


af (x) + Bf(%2) — flax + Baa) 
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can be represented by the double integral 
1 at 
le, — af def fay + (a — mr} dr, 


as is easily verified. Thus the inequality of the definition is 
certainly satisfied, provided that 


f(a) 2 9. 


On the other hand, the passage to the limit 7, > 2, shows that 
this condition is also necessary, and it is therefore a characteristic 
property of convex functions which are twice continuously dif- 
ferentiable. 

A fact which is noteworthy and useful in applications is that 
we need not assume the continuity of a convex function f(z); 
on the contrary, this property follows from the definition of 
convexity. We can in fact replace the above inequality by an 
apparently weaker one, which, however, is equivalent to it, 
expressed in the following theorem: 

[f for every x and h for which the arguments x +h still lie in 
the region of definition the bounded function f(x) satisfies the inequality 


Seth) +fe—h)— 2f(@) 20, 


that is, if the mid-point of every chord of the curve y = f(x) lies 
above or on the curve, then f(x) 1s convew. 
We first show that every bounded function f(x) which satisfies 
the inequality 
fl@+h)+fle—h)— 2f(@) 20 
ts continuous. 
To prove this we write the condition in the form 


f(x) —f(@—h) Sf(x+ h) — f(a), 
from which we derive the inequalities 


f(a — vh) — fle — (v + Ih) S f(x + h) — f(a) 
Sfet (vy + 1h) — fle + vh), 


valid for every integer v0. If we add these for values of v 
from v= 0 to v= n— 1, we obtain the estimate 
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and hence if we assume that | f(x) | SC, 


[fle+b) —fte)| <2. 


Here ” can be any positive integer such that the argument 
x + wh lies in the interval of definition. If we let h tend to zero, 
the largest possible number 7 increases without limit, that is, 
the expression f(x + h) — f(x) tends to zero. This proves the 
continuity of f(z). 

From the continuity of f(x) we can now easily prove its con- 
vexity, that is, we can establish the inequality 


af (2%) + Bf (at) — f (ax + Bx) = 0. 
From the inequality 


Sf (2) — f(a — nh) S n{f(@ + 2) —f(@)}, 
by means of the substitution 
€=a— nh, 
we obtain the relation 
SE + nh) —f() —f(E+ (m+ A) — f(8) 
n = n+1 , 
and hence in general 


AE + mh) — FE) < AE + mh) — FS) 0<msn 
m a ~ 


If we put + nh = &,, a few transformations give 
(1-20 + 2G 2F((1— e+ 24), 


which is exactly the inequality we require for rational values of a 
and f. We then deduce its validity for any values of a and B 
from the continuity * of f(z). 


* From the inequalities 
f(z + ¢) ~ f(®) < fe + a) ~ fz) < fa + 6) — fe) 
c re a nan 6 


whose validity for any numbers c < a S& b differing from zero is a direct con- 
sequence of the definition of convexity, we see that the difference quotient 
feroon) is bounded and monotonic if @ tends to zero through positive 
values or through negative values, and it therefore possesses a limit. Thus a 
convex function has a derivative on the right and on the left at every point. 
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Finally, we apply the following inequality, which is obvious 
from the geometrical interpretation of a convex function: 


f@+ h)+fe—h)— {(f@+ 8) +f@—s} 20. 


Here 5 and h are two positive numbers, 5 < h. 
This is proved by adding the two relations 


(1+ 3) te— +5 (1—Z) fe + —Fle— 8) 20, 


s(1—j)fe— +3 (14 F)Met i) — fle-+ 8)=0. 


We now return to the theorem of Bohr stated above (p. 324-5). 
We see at once that logI'(x) is convex. For if we write I(x) 
in the form 


00 
T(a) = if em HR e—1+ AZ g—t124e—1— W/E dy 
0 


where / has any positive value and « any value greater than h, 
and apply Schwarz’s inequality (cf. Vol. I, Chap. IX, p. 451), 
we have 

{Ta} ST + He hy, 
and therefore * 


logI'(z + h) + log l(a — h) — 2 logI(z) = 0. 


Again, if f(x) and g(x) are two continuous convex solutions of 
the functional equation 


u(x + 1) — u(x) = loge, 


* This fact is a special case of a general theorem. If the functions f,(zx), 
v= 1,2,..., satisfy the conditions 


f(z) ZO and {f,(x)}PS fle — h)f,(x + h), 


so that the functions logf,(z) are convex, then the sum z f(z) also satisfies 
these conditions. 
For if we write X f,(x) in the form 


FAC) 
Zhe) = LE Fer h) Vie +h), 


and use the relation 
fA) <1 
Vi (a —h) Via +h)” 
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the difference 
$(x) = f(x) — g(x) 


is a continuous periodic function of period unity. Moreover, since 
f(@+ 1) — f(a) = loge 
f(e) — f(a — 1) = log(z — 0), 
J (x) satisfies the relation 


and 


fle+ 1) +fe— 1) — 2f (a) = log = 
Since f(x) is convex, the inequality 
fle+ by + fle h) — 2f(o) Slog ——, 


holds for every / in the range 0 << h <1 (cf. p. 328). 
We likewise obtain 


Hu + h) + o(e — h) — 29(x) S log 
and therefore 


| f(a + h) + $a — h) — 24(x) | S 2 log 


x 
2 
a—l 


z 
r— 


If we now let x increase beyond all bounds, the expression 


log ; wl i tends to zero, and so does the function 
$(a + h) + $(a — h) — 24(@). 
we obtain the inequality 


{Pio S Ev fe= MH VIE e HY 
if we now apply Schwarz’s inequality to the right-hand side, we have 
ELOY SEhl@ - EL le + bi 
An analogous theorem holds for integrals of the form 
[teas 


if for all values of the parameter ¢ the functions f(z, t) satisfy the conditions 
f(z, ) 20 and {f(z,)} Sfx — h, hfe + ht). 


The gamma function is of this type. 
12¢ (Bg 912) 
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Since this function is periodic, we obtain the equation 
$e + h) + o(w — h) — 24(a) = 0, 


valid for every x > 0. 

A continuous periodic function ¢(x) which satisfies such a 
condition for every positive value of h and every value of x greater 
than h# must be a constant.* This, however, proves that any 
continuous convex solution of the equation u(x+ 1)— u(x) = loga 
can differ from logI'(x) only by an additive constant. 


3. The Infinite Product for the Gamma Function. 


In this sub-section we shall give the infinite products for the 
gamma function found by Gauss and Weierstrass. 
We first show that the relation 


T(x) = lm G,(@) 


holds, where 
1.2...(n—1) 
wz-+ 1)...(~7-+ 2 — 1) 


This statement is plausible, since for integers c = v we have 


G(r) = (v — 1)" 


n® 


G(x) = 


n n 
an+lon+tv—V 


and as n increases this obviously tends to the value (vy — 1)!. 

We must show in general that the sequence G,(x) converges 
for every x = 0, —1, —2,..., and then that the limit function 
G(x) coincides with the gamma function for positive values of 
x. To prove this last statement we notice that if «> 0 the 
function logG(«) satisfies the functional equation 


u(e + 1) ~ u(x) = logs. 


By Bohr’s theorem we have only to show that logG(z) is 
convex. 


*TIf, say, (1) = ¢(2) = a, then by the equation $(#) = 4(4(1) + 4( 2) 
we have $($) = a, and likewise ¢(z,) = @ at all points 2, of the interval 1S 
x <2 which are obtained by repeated bisection of the sub-intervals. Since 
these points x, are everywhere dense, from the continuity of (x) it follows 
that ¢(z) = a throughout the interval 1 Sa <2, and by the periodicity of 
é(z) this holds for every x > 0 
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In order to prove the convergence of the sequence G,,(x) for 
«= 0, —1, —2, ... we introduce the expression 


Piro: v(vs 3) (1+3)-- +53) 


for the number x, and accordingly write 
G,(2) = 1"F nf + (1 + 1/v)* 
. £ , 1+ T+ a/v v 
By a test proved in Vol. I, p. 421, the product 
7 + Ly) 
1 1l+2/v 
converges absolutely and uniformly provided that the series 
S/Q+iye_, 
1] l+<a/v 


converges uniformly. If we use the Taylor expansion in powers 
of 1/v up to the terms of second order, the general term of this 
series can be written in the form 


(l+ 1/yy?— (+ 2/v)_ 1 oe—1) (1+ O/v)"? 
1+ a/v 2y? 1+ 2/v ‘ 


where @ is a number between 0 and 1. From this it follows that 


(1+ Ip — (1+ 2/»)| © 


14+ a/v ye 
where C is a constant independent of v. In every closed region 
which contains none of the points x = —1, —2, ... we can re- 


place the estimate C by a number which is also independent of x. 
In every such region the series converges uniformly, and therefore 
the product does so too. 


The limit function 
Gte\ = “fie OS Ns 
see Ey (7+ n"— 1) 
is continuous for every z += 0, —1, —2, ... and, as we see at 


once, satisfies the functional equation 


G(x + 1) = xG(z). 
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Tn order to show that if x > 0 the function G(x) is identical 
with the function I'(z), we consider the function log G(x) for x > 1. 
It is the limit function of the sequence 


n—-1 
log G,,(x) = log(n — 1)! + zlogn — & log(a + »). 
v=0 


For any positive value of h and any value of z greater 
than h the functions logG,(x) satisfy the pondiieon for con- 
vexity, 


logG, (x + h) + logG,(« — h) — 2 logG,(x) 
="S @log(a+ ») —log(a+ b+ v) — log(z— h-+ v)) 20, 
v=0 


which consequently applies to the function logG(a) also. Since 
in addition 
log G(1) = 0 = logI'(1), 


by the general theorem G(x) must be identical with I(x). We 
have therefore obtained Gauss’s infinite product for I(x): 


1.2...(8—1) as 
rae. (e+n—1)" 


_ i (i+ 1/>)* 1/v)* 
eae “T+ a/v" 


The theoretical importance of this expression arises from the 
fact that we can regard it as defining the gamma function, not 
only for all positive values of x, but also for all negative non- 
integral values of 2. 

This product can easily be put in a somewhat different form. 
If in the expression 


Ta) = 


n* = e* logn 


we substitute for logn the value 
1 1 
logn=1+=+4+...+-— 
ogn +5 Peet a, + Ens 


where y is Euler’s constant (cf. Vol. I, p. 381) and e, tends to 


zero as n -> ©, we obtain an expression for 


1 
T(z) 
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1 ‘ x x x copiers 
_~_=:glim = ee at. HRB cee ey Tye ent 
i@ x lim (1+) (144) (1+ ae :) e~*-2 
x n—l £ x” 
== ge lime~**~n TT ( + “Ne 
1 Vv 


Since the factor e~**~a tends to 1 as n increases, the product 
# 


Il ( + 3) e-» also converges and gives Weierstrass’s infinite 
1 Vv 


product for Tw’ 


from which we see at once that Fa has zeros of the first order at 
(as 


the points z= 0, —1, —2,.... 


4, The Function logI'(x) and its Derivatives. 
If we form the logarithm of Weierstrass’s infinite product 
1 3 ie ry x xy 
ta = i( + “e : 
we obtain an expression for the function logI'(x): 
log I'(x) = —logr ~— yx — & (lee (a + \ _ *\. 
yer] v 


v 


By the relation 


v+t 
whence 
x x 1 r* Fra 
tog (14-5) ~ 3] st oe 


the right-hand side of the equation for logI'(x) is dominated by 
2 0 
the series 52 4 and therefore converges absolutely and uni- 


formly in every closed interval of the positive a-axis. 
The derivatives of the function logI(z) are of particular 
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interest, since they provide an explicit representation of the 
0 m 

values of the series ( J ) ; 
o\e+y 


If we differentiate the expression for logI'(x) term by term 
with respect to 2, we again obtain a series which, since 


1 1 x 


ay yp v(a + vy’ 
converges absolutely and uniformly in every closed interval of 
the positive z-axis. Hence, by known theorems on the differentia- 
tion of infinite pike 


ue) @ og l(a) = =— *- y- /( d -1), 


T(z) ) da atv »v 
If we again differentiate term hs term, we ow obtain 
& 
a! T(z) = 
and finally, forming the higher derivatives, 
oe al (=1)"_ dm 


. ox i (m— 1)! dam logI(x) = (m 2 2). 


5. The Extension Theorem. 

The values of the gamma function for negative values of x 
can easily be obtained from the values for positive values of 
x by means of the so-called extension theorem. If we form the 
product ['(x)I'(—x), which is 


2...(n—1) atlin 1.2...(n—1) ay 
n->ot(@-+1)...(2+n—1)  n+0©—2(1—x) (2—z)...(n~—1—z) 


and combine the two limiting processes into one, we obtain 


T(x) P(—2) =~ 


2 ee 1 
1 ; 
#2 ,->0{1—(@/1)} {1— @)2)... {I @/n—1))} 


But by the infinite product for the sine, 


ssa = ii (1- (%)) 
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deduced in Vol. I, p. 445, we have 
T(x) T(—2) = — 


Tv 
x sin 7x 


Hence 
a 
a sina T(z) 


I(—2z) = — 


We can put this relation in a somewhat different form by 
calculating the product ['(x)T'(1 — x). Since 


Tl — 2) = —aT(—2), 


T(z) T(1 — a) = —aI(x)T(—2), and we obtain the extension 
theorem 


(a2) Td — 2) = 


sin 7x 
Thus if we put =i, we have I(3)=+/m. Since 
T(4) = 2 f edu, here is a new proof for the fact that the 
0 


integral np edu has the value 44/7. In addition, we can 
0 


calculate the gamma function for the arguments z= n+ 3, 
where n is any positive integer: 


Co C-DE-9-0 


6. The Beta Function. 


Another important function defined by an improper integral 
involving a parameter is Euler’s beta function. The beta 
function is defined by 


B(a, y) = f eM — t)y-1dt. 
0 


If either x or y is less than unity, the integral is improper. By 
the criterion of section 4, p. 307, however, it converges uniformly 
in @ and y, provided we restrict ourselves to intervals v = e, 
¥ = », where ¢ and 7 are arbitrary positive numbers. It therefore 
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represents a continuous function for all positive values of x and y. 
We obtain a somewhat different expression by using the 
substitution ¢ = 7 -} 3: 


Bis, y) = ‘a G + SG — 1)"as, 


or, in general, if we now put 7 = ¢/2s, where s > 0, 
(28)e+7-1B(x, y) = f (6 + 1) — ytd. 
If, finally, we put ¢ = sin*¢ in the original formula, we obtain 


{2 
Bo, y) = 2 i sin?*-14 cos*’-1¢ dd. 


We shall now show how the beta function can be expressed 
in terms of the gamma function, by using a few transformations 
which may seem strange at first sight. 

If we multiply both sides of the equation 


(28)"*" "Biz, y) =/ (s + t)?-(s — t)y-1dt 
by e-** and integrate with respect to s from 0 to A, we have 
a3 Ss 
ne nfemepetinn [etal e+ ort aria 


The double integral on the right may be regarded as an 
integral of the function e-%(s -+ ¢)*-1(s — ¢)*-1, the region of 
integration being the isosceles triangle bounded by the lines 


s+t=0 and s=A. 
If we apply the transformation 


o=s+t, 
r=s—1, 


this integral becomes 
1 —T—~7 po—1y—l 
5 J e-*-To%-170-1 do dr. 


As the region of integration we now have the triangle in the 
o7-plane bounded by the lines o = 0, r= 0, and o + 7= 24. 
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If we now let A increase beyond all bounds, the left-hand side 
tends to the function 


= Bia, y) (a + 9). 


The right-hand side must therefore converge also, and its limit 
is the double integral over the whole first quadrant of the o7-plane, 
the quadrant being approximated to by means of isosceles 
triangles. Since the integrand is positive in this region and the 
integral converges for a monotonic sequence of regions, by 
Chap. IV (p. 263) this limit is independent of the mode of 
approximation to the quadrant. 

In particular, we can use squares of side A, and accordingly 
write 

4 A 
Bia, ye + y) = lm f vf e-*—*9*-1 79-1 dadr 
4—>o “0 


0 


te) 20 
— i eo do f er dr, 
0 fy 


We therefore obtain the important relation * 


_ TP) 
a Tey). 


From this relation we see that the beta function is related to 


the binomial cooficients ("+ "\— erat my in roughly the same 


* This equation can also be obtained from Bohr’s theorem. We first show 
that B(x, y) satisfies the functional equation 


Be + 1,94) = —— aoe Ys 
so that the function 
u(x, y) = (a + y) Bir, y), 
considered as a function of x, satisfies the functional equation of the gamma 
function, 
ula + 1) = wuz). 


Since by the theorem in the footnote on p. 328 it follows that logu(z, y) is 
a convex function of z, we have 


Tie + y) Bia, y) = T(x). aly), 
and finally, if we put x = 1, a(y) = I'(y). 
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way as the gamma function is related to the numbers m!. For 
integers =n, y= ™m, in fact, the function 
1 
@+y+)Bet+hLy+) 


has the value (" Ey ™). 
n 


Finally, we mention that the definite integrals 


7/2 {2 
f sinttdt and f cost dt, 
0 0 
which are identical with the functions 
es ac), 
2 2 ° 2 2 2 2 


can be simply expressed in terms of the gamma function: 


i! ony 7 T((1 + a)/2) 
ff sinsedt = f cost dt = - Ta) : 


EXAMPLES 


1. Prove that the volume of the positive octant bounded by the planes 
z=0, y= 0,z2=h, and the surface 


2. Prove that 


SffrG + v+ kein 


taken throughout the positive octant of the ellipsoid = aa 2+4 +5 el 


= (5) r() AO "®) ryog tatr_4 


S r(etety) “ 


dé. 
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(Hint: Introduce new variables & », ¢ by writing 


Oye tog er ear 
¥ f= by or oy = bV Ex(1 — %) 
== Ent z= ov/Ent, 


and perform the integrations with respect to » and ©.) 
3. Find the z-co-ordinate of the centre of mass of the solid 


1 1 1 
(2) + ()"+ Gs 220,y20,220. 
a, 6. ¢, 


4, Find the moment of inertia of the area enclosed by the astroid 
at + y! = Ri 
with respect to the x-axis. 
5. Prove that 
T(z) T(@ + 4) 
220 NE = Vr 
T(2z) . 


7. DIFFERENTIATION AND INTEGRATION TO FRACTIONAL 
OrpER. ABEL’s INTEGRAL EQUATION 


Using our knowledge of the gamma function, we shall now 
carry out a simple process of generalization of the concepts of 
differentiation and integration. We have already seen (p. 221) 
that the formula 


= (2 — t)” 1 Made © aha 
Fe) = [TF fode= 5 [e— om sind 
gives the een integral of the function f(x) between 


the limits 0 and z. If D symbolically denotes the operator in 


differentiation and if D~* denotes the operator I. * dx, which is 
the inverse of differentiation, we may write 


F(x) = D-" f(z). 
The mathematical statement conveyed by this formula is that 


the function F(x) and its first (n— 1) derivatives vanish at 
x= 0 and the n-th derivative of F(x) is f(z). But it is now 
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very natural to construct a definition for the operator D~* even 
when the positive number A is not necessarily an integer. The 
integral of order X of the function {(x) between the limits 0 and x 
as defined by the expression 


Df e) = a [ee — Pf de 


Ta) 
This definition may now be used to generalize nth-order 
differentiation, symbolized by the operator D* or - to wth-order 


differentiation, where » is an arbitrary non-negative number. 
Let m be the least integer greater than p, so that w= m— p, 
where 0< p <1. Then our definition is 


D* fle) = D®D-*f(a) = Fs [@— 9 *f at 


A reversal of the order of the two processes would give the 
definition 


l Oe 
D* f(x) = D-*D™ f(a) = — | (w4— t-1f™(é) dt. 

fle) fle) = ay fe PFO) 

It may be left as an exercise for the reader to employ the 
formule for the gamma function to prove that 


D*D'f (2) = D’D*f(a), 


where a and f are arbitrary real numbers. He should show that 
these relations and the generalized process of differentiation have 
a meaning whenever the function f(x) is differentiable in the 
ordinary way to a sufficiently high order. In general D*f(x) 
exists if f(z) has continuous derivatives up to and including the 
mth order. 

In connexion with these ideas we may mention Abel’s integral 
equation, which has important applications. Since I'($) = Wz, 
the integral of a function f\ : to the order } is given by the formula 


Dose) =<. [ FO. = we) 


If we assume that the function (x) on the right-hand side 
is given and that it is required to find f(x), then the above formula 
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is Abel’s integral equation. If the function (x) is continuously 
differentiable and vanishes at x = 0, the solution of the equation 
is given by the formula 


f(x) = Diyi(a), 


_ id f* 
fea), a 


or 


8. Nore on tHE DEFINITION OF THE AREA OF A CURVED 
SurFACE 


In section 6 of Chap. IV (p. 269) we defined the area of a 
curved surface in a way somewhat dissimilar to that in which 
we defined the length of arc in Vol. I, Chap. V (p. 277). In the 
definition of length we started with inscribed polygons, while in 
the definition of area we used tangent planes instead of inscribed 
polyhedra. 

In order to see why we cannot use inscribed polyhedra, we 
may consider a cylindrical surface in xvyz-space with the equa- 
tion a?-+ y? = 1, lying between the planes z= 0 and z= 1. 
The area of this cylindrical surface is 27. In it we now 
inscribe a polyhedral surface, all of whose faces are identical 
triangles, as follows. We first subdivide the circumference of the 
unit circle into » equal parts, and on the cylinder we consider 
the m equidistant horizontal circles z= 0, z= h, z= 2h,..., 
z= (m— 1)h, where h=1/m. We perform the subdivision of 
each of these circles into » equal parts in such a way that the 
points of division of each circle lie above the centres of the arcs 
of the preceding circle. We now consider a polyhedron inscribed 
in the cylinder whose edges consist of the chords of the circles and 
of the lines joining neighbouring points of division of neighbouring 
circles. The faces of this polyhedron are congruent isosceles tri- 
angles, and if n and m are chosen sufficiently large this polyhedron 
will lie as close as we please to the cylindrical surface. If we now 
keep n fixed, we can choose m so large that each of the triangles 
is as nearly parallel as we please to the ry-plane and therefore 
makes an arbitrarily steep angle with the surface of the cylinder. 
Then we can no longer expect that the sum of the areas of the 
triangles will be an approximation to the area of the cylinder. 
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In fact, for the bases of the individual triangles we have the 
value 2sinz/n, and for the altitude, by Pythagoras’ theorem, 
we have 


1 a\2 ne _4% 
Fins 1—cos— ) =4/—.+ 4sint —. 
ne + ( i *) m aa Qn 


Since the number of triangles is obviously 2mn, the surface 
area of the polyhedron is 


Pa m= 2mnsin= ne + 4sint” = 2n sin” afl + 4mm? sin’. 
: n Vm 2n n 2n 


The limit of this expression is not independent of the way in 
which m and n tend to infinity. If, for example, we keep n fixed 
and let m — ©, the expression increases beyond all bounds. If, 
however, we make m and n tend to © together, putting m = n, 
the expression tends to 27. If we put m= n?, we obtain the 
limit 27/1 + 74/4, and soon. From the above expression Fy, m 
for the area of the polyhedron we see that the lower limit (lower 
point of accumulation; cf. Vol. I, p. 62) of the set of numbers 
Frm is 27; this follows at once from Fy _ 2 2nsinz/n and 

lim 2n sin z/n = 27. 
na—>@ 
In conclusion we mention—without proof—a theoretically 


interesting fact of which the example just given is a particular 
instance. If we have any arbitrary sequence of polyhedra tending 
to a given surface, we have seen that the areas of the polyhedra 
need not tend to the area of the surface. But the limit of the areas 
of the polyhedra (if it exists), or, more generally, any point of 
accumulation of the values of these areas, is always greater than, 
or at least equal to, the area of the curved surface. If for every 
sequence of such polyhedral surfaces we find the lower limit 
of the area, these numbers form a definite set of numbers associated 
with the curved surface. The area of the surface can be defined 
as the lower limit (lower point of accumulation) of this set of 
numbers.* 


*This remarkable property of the area is called semi-continuity, or more 
precisely lower semi-continuity. 


CHAPTER V 


Integration over Regions in Several 
Dimensions 


The multiple integrals discussed in the previous chapter are 
not the only possible extension of the idea of integral to the case 
of more than one independent variable. On the contrary, there 
are other generalizations, corresponding to the fact that regions 
of several dimensions may enclose other manifolds of fewer 
dimensions and we can consider integrals over such manifolds. 
In the case of two independent variables, in addition to integrals 
over two-dimensional regions we can consider integrals along 
curves, which are one-dimensional manifolds. In the case of 
three independent variables, besides integrals throughout three- 
dimensional regions and integrals along curves, we have to con- 
sider integrals over curved surfaces, which are two-dimensional 
manifolds enclosed in three-dimensional space. These concepts 
of integrals along curves (curvilinear integrals), integrals over 
surfaces, and so on, with many straightforward applications, will 
be introduced and their mutual relations will be investigated 
in the present chapter. 


1. Line INTEGRALS 


We associate the definition of the single integral with the 
intuitive idea of area (Vol. I, Chap. II, p. 77) and arrive at the 
multiple integral by straightforward generalization to the case of 
a greater number of dimensions. On the other hand, the physical 
idea of work also leads us to the single integral (Vol. I, Chap. V, 
p. 304). If we seek to give a mathematical definition of work 
for an arbitrary field of force in space of more than one dimension, 


we obtain the curvilinear or line integral as a new generalization 
343 
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of the original concept of the integral of a function of a single 
variable. 


1. Definition of the Line Integral. Notation. 


We begin with the purely mathematical definition of the 
integral along a curve (line integral, curvilinear integral), in three- 
dimensional zyz-space. Let a sectionally smooth * curve C in 
this space be given parametrically by the equations 


v= xt), y= y(t), 2= xt), 


where, as usual, a(t), y(t), 2(¢) are continuous functions with sec- 
tionally continuous first derivatives. We consider an arc of this 
curve joining the points P, and P with co-ordinates (Zq. Yo: Zo) 
and (2, , 2) respectively and corresponding, say, to the values 
of the parameter ¢ in the interval a<it< B. Tf a continu- 
ous function f(z, y, z) is defined in any region containing this 
arc, then along the arc this function will be a function 
F (x(t), y(t), 2(é)) of the parameter t alone. In order to define, 
in analogy with the ordinary integral, a line integral of the 
function along the curve C, we divide up the arc into small 
pieces by means of the points Po, P,, P,,..., Pa, (Pa = P) and 
denote the difference of the abscisse of P, and Py by Ay 
We now form the sum 


n~1 


E flalts), y(t), at) Aer 


where ¢, can be given any value in that interval of the parameter 
which corresponds to the arc between P, and P,,,. If we let the 
number of points of subdivision increase beyond all bounds and 
assume that the length of the longest of the arcs P;Pi4, tends 
to zero, then we may expect that the above sum will tend to a 
definite limit. This limit we denote by 


[fe Ade 


and call it a line integral of the function f(x, y, 2) along the curve 
C. That this limit does exist and is actually independent of the 
* Here, as before (cf. p. 41), we say that a curve is sectionally smooth (Ger. 


stitckweise glatt) if it consists of a finite number of arcs, each one of which has 
a continuously turning tangent at each of its points, including the end-points. 
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choice of the points of division can be proved directly, just as 
we proved the existence of the ordinary integral. It can be 
proved even more simply, however, by writing the sum in the 
form 


Eflote), wt) ae) At At 


where At, denotes the increment of the parameter ¢ as we pass 
from one point of subdivision to the next. By the definition of 
the ordinary integral, in the passage to the limit the right-hand 
side tends to 


8 da: 
f fe, wo, a) Fa, 
and for the line integral we obtain the expression 
[fey ade=ffeo 9) 2 a 
F ? > ji > di t) 


which expresses the line integral as an ordinary integral with 
respect to the parameter ¢. 

The ordinary integral is a special case of the line integral, 
which arises if we take an interval of the z-axis as the path of 
integration. 

We can now define the line integrals 


[Flos 2)dy =f Flay, 2) Y at 
and 


[ fe, ¥y; ade = fe, y; aS at 


just as above. Using the right-hand side of the formule, we can 
verify the fact that the line integral depends only on the curve 
self and not on the way in which it is expressed, i.e. not on the 
choice of parameter. For if we use the continuously differentiable 
function ¢(¢) to introduce a new parameter 7 = ¢(¢) and if in 
the interval in question d¢4(é)/dé > 0, then we have a one-to-one 
transformation of the parameter interval into a parameter 
interval a, S 7S f,, and 


8 dz Bs dz 
[fe Y, 2) ai dt =f tte, Ys 2) zu. 
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In applications line integrals usually occur in the following 
combination. Let a(x, y, 2), b(x, y, 2), (x, y, z) be three functions 
which are continuous in a region containing C. We consider the 
sum of the three line integrals 


f a(x, y, 2) da +- f d(x, y, 2)dy + i (x, y, 2) dz, 
Qo oO oO 
which can also be written in the form 
B 
[{ada+ bay + cde} =f (at + by + c2)dt, 
0 a 


where, as before, dx/dt = ¢, and so on. We suppose that the func- 
tions a, 6, ¢ are respectively the w-, y-, and z-components of a 
vector A and that x is the position vector of the point (x, y, 2) 
of the curve. Then the quantities ¢, ¥, 2 are the components 
of the vector x = dx/di, and we can write the integrand as the 
scalar product Ax. For the line integral we thus have the 
expression 


B e 
[Atd=fAdz, 


where the meaning of the notation is obvious. 
Just as we have considered line integrals in three-dimensional 
gpace, so we can of course consider similar integrals in the plane: 


[te ydx, f ftw, vdy, f {ado + bay}. 


Moreover, these ideas can be extended to line integrals of func- 
tions of n variables. In this general case we can most simply 
define a line integral 


[fle Xe, eared » Gy) dx; 


by supposing that in n-dimensional space the quantities 
Ly, Vo, +. L, ate all given as functions of a parameter ¢ in the 
interval asSt< 8. The values 2,(t), x(t), ... , v(t) in this 
interval then correspond to a curve C in n-dimensional space. 
We then define the line integral 


f fe Lg +++ » Ly) AX, 
Q 
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by the expression 
B dz; 
f FAO, 20), «+ + Balt) SE dt. 


If we consider » functions a,, dy, ... , , of the n variables 
1, Lp) . «+ » Zp, then we can again form the general line integral 


f {ade “+ dy dity +... + Gyan} 
Qo 
and express it in vector notation in the form 
f "Axedt = fAdx, 
a oO 


where, as above, by A we mean the “ vector” with components 
(a4, Gg, +++, @) and by x the position vector of the point 
(1, Va,» + « 5 By). 


The formule for the area of a region bounded by a closed curve C 
(Vol. I, Chap. V, section 2, p. 273) provide an instance where a line integral 
occurs naturally. If the closed sectionally smooth curve C in the ay-plane 
is given by the equations « = a(t), y = y(t), the area A of the region bounded 
by the curve is given by 


B B 1 fé 
A= — f yea =f ayat = — 5 f tye — apa 


In our new terminology these are simply the line integrals 


= — fyde= fody = — 5 [iva — nay), 
o o 230 


taken round C in the direction in which the value of the parameter 
increases, 


2. Fundamental Rules. 


From the expression for line integrals in terms of ordinary 
integrals we may draw several immediate conclusions. 

The value of the line integral depends on the sense in which the 
curve O is described, and in fact is multiplied by —1 of the sense 
of description is reversed, i.e. if the curve is described from P to Po 
instead of from P, to P. The proof of this is self-evident. This 
sign property makes it always convenient to think of the curve 
C as having a definite direction; we then call it an oriented curve 
(cf. Vol. I, Chap. V, section 2, p. 268). We shall occasionally use 
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the symbol —C to denote the curve obtained by describing C 
in the reverse direction. 

If the curve C is formed’ by joining together two curves C, 
and C, described in succession (which we may indicate by writing 
C=C,+ C,), then the relation 

+f 


deals 
holds for the corresponding line integrals, the meaning of the 
notation being obvious. 

The following rule is particu- 
larly important. If we restrict 
ourselves to the case of two 
variables x, y and consider a line 
integral 


[fade + bay} 


along a closed curve C (like that 
Fig. 1 in fig. 1) within which the vector 
field a, 6 is everywhere defined 
and continuous, then the formula 


[fade + bdy} 
= | {ade + bdy} + f {adn + bdy} +... +f {ada + bdy} 


holds for every resolution of the closed region R bounded by the 
oriented curve C into similarly bounded sub-regions R,, Re, ..., Ry 
with boundary curves C,, Cy, ...,0,. Here we assume that all 
the regions are described in the same sense. To prove the state- 
ment, we notice that in the addition of the integrals on the 
right the parts which are taken over a portion of the boundary C 
add together as is required to form the integral round C, while 
every boundary curve lying within R is the common boundary of 
two sub-regions and is consequently described twice, once in each 
direction, so that the integrals along these arcs cancel one 
another. 

Exactly the same result applies to the resolution of a line 
integral along a curve C in three (or more) dimensions, provided 
that the curve forms the boundary of a portion of a surface and 
this portion is subdivided by the curves C;, C,,..., Cg. 
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A somewhat different application of this principle occurs in 
the following theorem. Let two oriented closed curves C and Cc’ 
(cf. fig. 2) be subdivided by the points A,,...,4, and Ay’,...,4y' 


Fie 2—$f,.=So-Se 


respectively, in the order of the sense of orientation, and let 
each pair of corresponding points A; and A,’ be joined by a 
curved line. If by C, we denote the closed oriented curve 
AA 447, then 


2, [lod+ bdy) = [ (ada + bdy) — f (adu + bdy). 


The proof of this theorem is immediately suggested by the 
figure. In order that it may hold, it is not necessary to assume 
that the two curves C and C’ never intersect themselves or one 
another. 

Finally, we mention an integral estimate for line integrals: 


[fade + bdy + cde} < ML, 


where M is an upper bound of +/(a?+ 6+ ¢?) on C and 
L is the length of C. The proof follows at once from the 


inequality 
dx, ,dy, d@|_ zp (@ 2 a) 6) 
agrees SVPLE+A (7) +(5) +(3)- 


which is obtained by applying Schwarz’s inequality (Vol. I, 
p. 12). 
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3. Interpretation of Line Integrals in Mechanics. 

As we have already mentioned, the line integral is closely related to 
the idea of work. If a particle moves along a curve under the influence of 
a field of force—which in general may vary from point to point—and if 
the field of force is given by the vector A with components a, b, c, the 
line integral represents the work done by the field on the particle. For, if the 
force is constant and the motion takes place in a straight line, the work 
is defined as the scalar product of the force vector and the “ displacement ” 
vector. In order to generalize this definition convincingly, we replace 
the path C by the polygon with vertices Po, P,, Py,..., Py, = P, and 
instead of the actual force we take a “ substitute force ’’ which is con- 
stant along each of these segments P;P,,,, being equal to the actual 
value of the force at the initial point P;. The work performed by this 
substitute force along the segment from P, to P,,, is 


A(X Ye 2,) Ax, -+ O(x,, Yy 24) Ayg + C25, Yer 2) Ary, 


since the displacement vector from P; to P,,, has the components 
Az,, Ay, Az, Tf we sum over the whole polygon, we obtain an expression 
which tends to the line integral as we pass to the limit n-»>0. Thus the line 
integral is actually the expression for the work done during the motion. 

Other physical interpretations of the line integral will be given later 
(cf. section 3, pp. 370-1). 


4. Integration of Total Differentials. 


A particularly important case is that in which the vector A 
with components (a, b, c) is the gradient of a potential,* i e. there 
exists a function F(a, y, z) of the co-ordinates such that 


A = grad Fk 
or 
a=F, b=Ff,, c=F, 


Although in general the value of a line integral in a vector 
field depends not only on the end-points but also on the entire 
course of the curve C, the following theorem is valid here: 

The line integral over a gradient field is equal to the difference 
between the values of the potential function at the end-points and does 
not depend on the course of C between the end-points. That is, we 
obtain the same value for all curves which join the two end- 
points and remain entirely within the region in which the potential 
function F is defined. 


*If A = grad F, then the function F is often called the potential of the 
vector field. 
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In this case the line integral takes the form 
8 
[ade + bdy + cdg} =f {Ft + Fyy + Fahd, 


and the expression in brackets on the right is simply the derivative 
dF /dt of the function F with respect to the parameter t. We 
can therefore perform the integration explicitly, and obtain on 
the right the difference of the values of F at the end point and 
the initial point of the path of integration. In this case, therefore, 
we at once have the formula 


[{ade+ bay + cde} = Fal), ¥(B), (8) — Fela), x(a), 2(a)). 


This applies e.g. to the field of force due to a gravitating 
particle, which we have already (Chap. II, section 7, p. 91) 
recognized as the gradient field of the potential 1/r. The work 
done by this gravitational force when another particle moves 
from its initial position to its final position is therefore indepen- 
dent of the path. 

The expression adx + bdy + cdz is formally identical with 
what we have (p. 66) called the total differential of the function 
F(a, y, 2), 

adx + bdy + edz= dF. 


We may therefore write our formula in the form 
[ak = F(oX(B), (2), AB) — Flea), ya), 2(a)) 


and speak of integrating the total differential adz +- bdy +- edz. 

The following fact is of fundamental importance. The state- 
ment “the integral ts independent of the path” is equivalent to 
the statement “the integral round a closed curve has the value 
zero”. For if we subdivide a closed curve by means of two 
points Py and P into two ares C and (,, the equality of the 
two line integrals taken along C and OC, from Py to P means 
exactly the same thing as the vanishing of the sum of the integral 
taken along C in the direction from Py to P and the integral 
taken along C, in the direction from P to Pg; and this sum is 
the integral taken round the closed curve. 
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5. The Main Theorem on Line Integrals. 


As we have already emphasized, it is only under very special 
conditions that a line integral is independent of the path, or, 
what is equivalent, that the line integral round a closed curve is 
zero. For example, if a closed curve C forms the boundary of a 
region of positive area, then by p. 347 the line integral Jady or 
f(ady— ydz) is not zero. The chief problem of the theory of line 
integrals is to show that the sufficient condition for independence 
of the path, given on p. 350, is also necessary, and then to express 
this necessary and sufficient condition in a convenient and useful 
form. 

We shall first investigate this question of independence of the 
path in the case of plane curves. We may add in advance that 
the results in the case of three or more variables are exactly 
analogous. 

We now make the following assumptions. Let the functions 
a(x, y) and b(«, y) (which we shall again interpret as components 
of a plane vector field A), together with their partial derivatives 
a, and b,, be continuous in a region R of the plane. The follow- 
ing theorem then holds: 

The line integral 

ie {adu + bdy} 


taken along the curve C in R is independent of the particular choice 
of the path C and ts determined solely by the initial and final points 
of the curve C, tf, and only if, adx + bdy ts the total differential 
of a function U(x, y), that is, if, and only tf, a function U(x, y) 
exists in R such that the relations 


U,=4, U,=6 


A = grad U 


hold everywhere in R. — 

We have already proved on p. 351 that this condition is 
sufficient, i.e. that from this it does actually follow that the 
integral is independent of the path. 

It is easy to see that the condition is necessary. If the integral 
is independent of the path, then for a fixed initial point P, of C 
it is a (one-valued) function U(é, 7) of the co-ordinates (€, ) of 
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the end-point P. U(€, 7) is differentiable with respect to ¢ and », 
and in fact for every interior point of R we have 


Ug, 1) = lim 5{U(E +h. 2) — UE, 9} 
bot 
= lim 1 [fade + bay}—f {ada + bdy}] 


ee. 
= lim - dz bd * 
Jim 5 Sf + bay} 


Here C is any sectionally smooth curve whatever joining Py to 
the point P in R, and C, is a sectionally smooth curve in R joining 
P to the point P, with co-ordinates (€-+ h, 7). Since for suffi- 
ciently small values of h the line-segment PP, belongs to R, this 
segment can be taken as the path of integration C,. Then the 
parametric representation <= t, y=, StS &+h of this 
curve OC, gives 


. lL pith 
OE, 0) = jim | fat, a)dt= ale, 2). 
Similarly, we find that: 
. lL prte 
U,(6, 1) = lima 5 f° O(6, 0) dt = B¢ 2). 


Hence it is actually true that U,(z, y) =a, U,(x, y) = 6, as 
was stated. This result, which has so far been proved only for 
interior points of R, holds on the boundary also, in virtue of the 
continuity of all our functions. 

The above theorem, however, is of no great value, since as 
yet we have no general way of finding whether the vector field 
A is a gradient field or not. Instead of the gradient character of 
the vector field, we therefore attempt to state some other condition 
referring only to the functions a and b themselves. This is given 
in the following main theorem: 

If R is a simply-connected (open) region, a necessary and at the 
same time a sufficient condition that the integral f (adx + bdy) 

0 


shall be independent of the path C joining two given points in R 
is that the “‘ condition of integrability ” 
ay = 6, 
13 (8912) 
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is satisfied for all points of R. For a fixed initial point of C the 
integral ff (adx + bdy) then represents a function U(E, ») of the 
6] 


co-ordinates (&, n) of the end-point, and the vector field A 1s the 
gradient field of this function U, which may therefore be called 
the potential of the field. 

That the condition is necessary follows from the theorem 
which we first stated and proved. For by this theorem, 
if the integral is independent of the path, a function 
U(z, y) exists in R for which U,=aand U,=b, Since the 
derivatives 

Oye => a,(z, y) and Uay = b,(a, y) 


are continuous, by Chap. II, section 3 (p. 55) the equation 
Usy = Uye holds, and therefore 


a,(x, y) == b,(2, y); 
as stated. 

Tn order to show that the condition a, = 6, is also sufficient, 
and consequently equivalent to 
g the condition that A is a 
Pp’ P gradient, we must now use the 
assumption a, = 5, to construct 
a function U(z, y) in R such that 
U, = a(x, y) and U,= O(a, y). 
We first consider the simple case 
in which F# is a rectangle with 
4 sides parallel to the axes, given 
x 3 by the inequalities a <2 < 8, 
Fig. 3 y<y< 5. The fixed point Py 
of the region with co-ordinates 
(o, 9) is joined to the point P with co-ordinates (£, 7) by means 
of two line-segments P,P’, P’P parallel to the axes, meeting 
at the point P’ with co-ordinates (£), 4). The line PoP” is para- 
metrically represented by «= £, y= t, where 9 St S », and 
PP by = t, y=, where £, StS & (ef. fig. 3). Hence the 
integral {(adz + bdy) from Py to P taken along this pair of 

lines is given by 


[{ade + bdy} = [Ue t)dt + fiat, n) dt. 
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The function 


7 & 
U(é, 1) = f b(Eo, t)dé + f a(t, n) dt 


defined in this way is the function required. For by differentiation 
we at once have 


U;(é, 7) = al, ) 


and 


fd 
U,(é, 9) = WE 0) + i [ett nae 


Since a,(t, 7) is continuous, we may differentiate under the 
integral sign on the right: 


é 
UE, 1) = Wey 0) +f a(t, 0) dt. 
2) 
As a,(x, y) = 6,(2, y), we have 


é 
U,(é, 9) = bo, 9) + i bj(t, 4) de 


= b(Eo, n) + bg, ”)— b(Eo, n) = bg, n). 


Thus the statement about the derivatives of U(é, ) is proved, 
and from this it follows at once that the line integral is inde- 
pendent of the path. In general, therefore, 


U(E, 2) = [(ade + bay), 


where C is an arbitrary sectionally smooth curve joining P, to P 
and lying in the rectangle. The theorem is accordingly proved 
for the case of a rectangular region R. 

To generalize the result for any simply-connected region R 
we have merely to extend the construction of the function U 
to such a general region. We say that a two-dimensional 
open region is sumply-connected if every closed polygon within it 
can, by a continuous deformation within the region, be made 
to shrink up to a point. This pictorial idea of shrinking to a point 
can be made precise in the following way. Let the vertices of the 
polygon II be Py, Py, ..., Py with co-ordinates (x, Yq), 
(21, ¥1), - + + » (ny Yn) Tespectively. We now think of these vertices 
ag moving continuously with the time, starting at P,, P,,..., P,, 
respectively when ¢= 0 and all coming together at timet = 1 
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at one and the same point (£, 7) in R. That is, we suppose that 
there are points P,(t), P,(t),..., Pa(t), whose co-ordinates (x,(t), 
Yol(t)); (x(¢), y(t), eae 8 (2,(¢), Yn(t)) are continuous functions of 
tfor 0<¢S 1, and also that 


P,(0) = (% (0), ¥o(0)) = Po, - + - > Pa(O) = (n(0), yn(0)) = Pp 
and 


Po(l) = (@o(1); Yo(1)) = (€, 0), «++» Pa(1) = (@n(1), Yo(1)) = (€5)- 


Of course any closed polygon can be made to shrink to a point 
if we do not restrict its position in any way. The essential feature 
of our definition of a simply-connected region is that every closed 
polygon in the region can be shrunk to a point, the polygon IT(t) 
with vertices Po(t), P,(t)..., Pat), Po(t) remaining in the region 
during the whole process of shrinking, i.e. for all values of ¢ in the 
interval O<¢<1. 

It is intuitively clear that this definition agrees with that on 
p. 41. For if our region R is multiply-connected in the sense of 
p. 41, there is a “ hole ” in it, and a closed polygon in & enclosing 
this “hole” cannot be shrunk to a point without crossing the 
“hole ”, ie. without leaving R. Conversely, if there are no 
“holes” in R, any closed polygon can be shrunk to a point. 
We shall not prove this analytically, however, as the proof is 
lengthy and, moreover, we require only the definition given 
here. 

We shall see that in the generalization of our main theorem 
the limitation to a simply-connected region FR is essential. 

This generalization for any simply-connected region follows 
the same lines as the proof for rectangles, in that we again con- 
struct a function U(a, y) in the region R for which U, = a and 
U,= 0. Starting from an arbitrary point Py in R, we define 
U(a, y) by the statement 


Ula, y) =f (ade + bay), 


where the path of integration is any polygonal path in RF joining 
the point P, to the point P(z, y). If we can show that the value 
U(a, y) thus defined is independent of the particular polygonal 
path which we have chosen, then we have actually constructed 
a function which satisfies the conditions U, = a, U,= 6. 
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We therefore have merely to prove that the integral is in- 
dependent of the path, or instead, that the integral [(adx + bdy) 
round a closed polygon II containing the point Py vanishes. For 
this purpose we make II shrink to a point in R; that is, in R we 
form the polygon II(¢) with vertices P,(t), P,(é), ..., Pa(t) which 
coincides with II at t= 0 and reduces to a single point at t= 1. 
Since the “line integral” for a single point—a curve of zero 
length—clearly has the value zero, our problem is merely that of 
showing that the line integral along II(¢) remains constant as ¢ 
varies from 0 to 1; we shall then know that the integral along 
II(¢) is 0 for all values of ¢, and, in particular, that the integral 
along II is 0 for t= 0. 

Now consider any value ¢’ of ¢t. Since the polygon II(¢’) lies 
within R, we can choose a sequence of points (not necessarily 
vertices) Ay == P,(t’), Ay’, Ag’, ..., Am’ = Ag on II(é’) so close 
together that each pair A,’, A,',, lies within a rectangle R, interior 
to R. Iftis any parametric value close enough to ¢’, the polygon 
TI(é) lies so close to II(¢’) that on II(é) we can choose points 
Ay, A;,...; Am = Apo for which the segments A,'A, and A,’+,444, 
and the whole are 4,A,,, all lie in the rectangle R, Then by 
what we have already proved for rectangles, the integral round 
the closed polygonal path A,’A,',,4;,,4,4, is zero. Thus if we 
denote that polygonal path by C,, we have (cf. p. 349) 


f (adz + bdy) —f (adz +- bdy) sf (adx + bdy) = 0. 
1) m(t') i=07 g; 


For all values of ¢ close enough to ¢, therefore, the integral 
round II(¢) is equal to the integral round I(t’). Thus if we think 
of the integral round II(¢) as a function ¢(¢) of the parameter ¢, 
it follows that ¢(t) is a constant; that is, the integral round 
II(é) has the same value for every value of ¢, which is what was 
required to complete the proof of the theorem. 

Finally, we emphasize that for three or more dimensions 
an exactly analogous theorem holds and is proved in an exactly 
analogous way. We content ourselves by stating the theorem 
for three variables: 

Tf in an open region R within which any closed polygon can 
be made to shrink continuously to a point we are given a continuous 
vector field A with components a(x, y, z), b(x, y, z), c(x, y, z) and 
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continuous partial derwatives a,, az, b,, b,, C,, Cy, then a necessary 
and sufficient condition that the line integral 


f{ {ade + bdy + cdz} 
o 
may be independent of the path C in R 1s that the conditions 


Oy = by, b, = Cy, Cy = Ags 


or, in vector notation, the condition 
curl A = 0, 
shall be satisfied. 


For a fixed initial point P, the line integral is a function 
U(a, y, 2) of the co-ordinates of the end-point, and in fact 


[fade + bdy + cdz}= U(a, y, 2) — U(ap, Yo; 2), 
0 
or, in vector notation, 


“Adx = U(P) — U(P,), 


where the convenient abbreviation U(P) denotes the value of the 
function U at a point P. 


6. The Significance of Simple Connectivity. 


Throughout the above discussion it is essential that the 
region under consideration should be simply-connected. If the 
connectivity of the region were not simple, we should not be 
certain that the function U could everywhere be determined 
uniquely by integration along polygonal paths. 

We give the following example to show that in multiply- 
connected regions the conditions of integrability are not sufh- 
cient to ensure that the integral is independent of the path. 


The functions 
ae, y)= — Keg 
Y ony »y TR 


are defined and continuous for all values of z, y except z = 0, y = 0. Their 
derivatives 


22? 


1 Qy? 1 
ete @+ ee 


We = — ara t at yy 


bax, y) = 
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are also continuous, except at the origin, and satisfy the condition 
yx 


a, (x, y) = 6,(2, y) = + yt 


if we now take the integral 
f {adz + bdy} 
0 


round the circle C with centre at the origin given by x = cos#, y = sint, then 
C cannot be enclosed in a simply-connected region R in which the assump- 
tions are satisfied; for the region R we must take a ring-shaped region 
that does not contain the point (0, 0). Then 


2 ar 
[fade + bdy} =f {—sin#(—sint) + cost . cost} dt = fa = 2n, 
oO 0 ) 
and the integral round the closed curve is therefore not zero.* 


ExaMPLes 
1. Evaluate the integral 


fe sinydx + e® cosydy), 
Qo 


where C is a curve joining the points (0, 0) and (&, 7). 
2.* Evaluate the integral 
et 


e® ‘ A 
ie +H (x cosy + y siny)dy + ate (asiny — y cosy) dz) 


along a closed, curve enclosing the origin, which does not intersect itself, 


2. CONNEXION BETWEEN Line InrecRaLs aNnD Dovuste Ints- 
GRALS {N THE Puane. (THE INTEGRAL THEOREMS OF 
Gauss, STOKES, AND GREEN.) 


1. Statement and Proof of Gauss’s Theorem. 


For functions of a single independent variable one of the 
fundamental formule stating the relation between differentiation 
and integration is 


[ f@de = Fle.) — fle. 


Xo 


* We may remark in passing that the value of the integral /(adz + bdy) 
for any curve which does not intersect itself and which encloses the origin is 
the same, namely, 27. This follows immediately from the general theorem on 
subdivisions (cf. p. 349) if we subdivide the ring-shaped region between two such 
curves C and C’ into a number of simply-connected regions by cross-curves C; 
and apply the theorem to each of these. 
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An analogous formula—Gauss’s theorem—holds in two dimen- 
sions. Here again a differentiation is cancelled by an integration, 
in the sense that double integrals of the form 


[ffedady ot ff aydady 


are transformed into integrals that are only taken round the 
boundary curve C of R. We here regard the boundary C as an 
oriented curve and indicate the sense of description by means of 
a sign. Gauss’s theorem is then as follows: 

If the functions {(x, y) and g(x, y) are continuous and have 
continuous derivatives in a region R bounded by a sectionally smooth 
curve ©, then the formula 


J [Ufo 0 + gle, ydedy = [ {1 (@, 9)dy — gz, y)de} 


holds, where the integral on the right is a line integral round 
the closed boundary C of the region, taken in the positive sense 
of description, i.e. in such a way that the interior of the region 
R remains on the left as the boundary is described. 

In the proof we first restrict ourselves to the case in which 
the boundary C is cut by every line parallel to one of the axes 
in two points at most; in addition, we assume that g(x, y) is 
zero everywhere in R. Then by the results of the previous chapter, 
section 3 (p. 243), we can express the integral 


Jf fede, ydady 
as a repeated integral in the form 


J f fol y) da dy = [ay ffde, y) da, 


where y ranges over the interval to which points of R correspond 
and the integral [f,(x, y)dz is to be taken along the segments 
common to the lines y = const. and the region R. If xo(y) (fig. 4) 
denotes the point of entry and a,(y) the point of emergence of the 
parallel at the distance y from the x-axis, where 7, 2 2, then 


[fees wyda = F2(9), 9) ~ Hed 9) 


If, further, we denote the least and greatest values of y to which 
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points of 2 correspond by 7 and 7, then by integrating this 
equation with respect to y from 49 to 7, we obviously obtain 


J [ fele, y)dedy = f “felw) vdy +f “fled y)dy. 


For the special case g(z, y)== 0, however, this equation is 


equivalent to the theorem of Gauss stated above, as follows 
immediately from the definition of the line integral 


[fe nay. 


Tt is to be noted that the case in which the boundary of R contains 
portions parallel to the z-axis is included in the above. These 
portions contribute nothing to the boundary integral, for along 
every such portion the line integral J f(z, y)dy vanishes, since y 
is constant there. 

If we make use of our assumption that no parallel to the 
y-axis cuts the boundary of R in more than two points, the same 
considerations lead us to the formula 


ey 
J fale, 9) dedy = f Yale, yte)) — oe, yole))} de 
or * 


f f gut, y)dady = — f g(a, y)de. 


* The occurrence of the negative sign on the right-hand side should not 
cause surprise; the z-axis and y-axis in the plane are not exactly equivalent, 
as the z-axis is transformed into the y-axis by a positive rotation of 2/2, while 
the y-axis is transformed into the z-axis by a negative rotation of 7/2. 

3 (8912) 
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Addition of the two formule finally gives Gauss’s theorem in 
the general form 


f f (fla, y) + gle, y)ldedy = f { fle, 9) dy — g(a, y)dx} 


stated above. 

We can now extend our formula to more general regions, 
which do not possess the property of being cut by every parallel 
to the axes in two points at most. We start from the fact 
that by piecing together a finite number of regions with that 
property we can construct regions which in general do not possess 
such a property (cf. fig. 5). For each separate region Gauss’s 

theorem holds; and, on addi- 


tion, the parts of the line 
integrals along the internal 
(os connecting lines cancel one 


another in the usual way (p. 

349), since each of these is 

7 traversed twice, once in each 

direction, and we are left with 

Fig. 5.-—Non-convex region formed from Gauss’s theorem for the entire 

convex regions region. Conversely, this proves 

Gauss’s theorem for all regions 

R which can be divided into a finite number of sub-regions 

in such a way that the boundary of each of these sub-regions is 

intersected by parallels to the co-ordinate axes in not more than 

two points. We mention without proof that Gauss’s theorem 

does actually hold for any region with sectionally smooth boun- 

daries.* The proof can be obtained by a passage to the limit. 

In conclusion we remark that the condition that the region 

can be divided into a finite number of sub-regions, each of which 

is cut by every line parallel to an axis in two points at most, can 

be replaced by the following condition: the boundary of the 

region can be subdivided into a finite number of portions, each of 

which has a unique projection on the two co-ordinate axes; here, 

however, we allow the projection on one of the two axes to 

consist of a single point, i.e. we allow the boundary to contain 
portions parallel to the axes. 

* For such regions our assumption is not necessarily satisfied. For example, 


the boundary may partly consist of the curve y = «* sin 1/x, which is cut by the 
z-axis in an infinite number of points. 
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As a special application of Gauss’s theorem we deduce our previous 
formulz for the area of the region R. We put f(x, y) = x and g(a, y) = 0, 


and at once obtain 
A= facdy =f aay. 
“ +0 


for the area A. In exactly the same way, if f(z, y) = 0 and g(x, y) = y, we 
obtain 
A=— i. ydz, 
+0 


in agreement with previous results (Vol. I, p. 273). For the sign, see 
section 4, 1, below (pp. 374 ef seq.). 


2. Vector Form of Gauss’s Theorem. Stokes’s Theorem. 


Gauss’s theorem can be stated in a particularly simple way 
if we make use of the notation of vector analysis. For this 
purpose we consider the two functions f(z, y) and g(z, y) as 
the components of a plane vector field A. The integrand is 
then given, by the equation 


Salz, ¥) + 92, y) = div A, 


as the divergence of the vector A (cf. p. 91). In order to obtain 
a vector expression for the line integral on the right-hand side 
of Gauss’s theorem, we introduce the length of arc sof the boundary 
curve C; the positive sense of description is to be taken as the 
direction in which s increases. The right-hand side then becomes 


[fle No — alas ya} ds, 


where we put dx/ds = ¢ and dy/ds = 9. 

We now recall that the plane vector ¢ with z-component 
é@ and y-component y has the absolute value unity and the 
direction of the tangent, and points in the direction in which s 
increases, while the vector # with z-component ys) and y-com- 
ponent —2(s) has the absolute value unity and is perpendicular 
to the tangent, and, moreover, has the same position relative 
to the vector ¢ as the positive z-axis has relative to the positive 
y-axis.* Hence if the direction in which the length of arc increases 

* We see this from considerations of continuity; we may suppose that the 
tangent to the curve is made to coincide with the y-axis in such a way that 
the ¢-direction is the same as the direction in which y increases. Then z = 0, 


y = 1; and from this it follows that the normal vector ~ must point in the 
direction of the positive z-axis, 
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is that in which the boundary of the region is positively described, 
w is the unit vector in the 
y t direction of the outward-drawn 
normal (fig. 6). It is useful to 
notice that we can also write 
the components of the normal 
vector 7 in the form 
7) 


i= 2, a) = 


n on’ 


0 x 

Fig. 6.—Tangent and normal directions Where 0/0n denotes differentia- 
tion in the direction of the 

outward-drawn normal; * Gauss’s theorem can therefore also be 

written in the form 


| [Gort onaedy = [( Se + 93) a. 


We now see that the integrand is simply the scalar product 
Am or the normal component of the vector A. Consequently 
we obtain Gauss’s theorem in the important form 


J [div Adzdy = [ Ands = [ Ands. 
cu a 


In words: the integral of the divergence of a plane vector field over 
a closed region RB, is equal to the line integral, along the boundary, 
of the component of the vector field in the direction of the outward- 
drawn normal. 

In order to arrive at an entirely different vector interpre- 
tation of Gauss’s theorem in the plane, we first replace g(z, y) 
by —g(x, y). Gauss’s theorem then gives 


J fUtele 0) — guke, yN dandy = [gle yt + fle yb ds 


If the two functions f(x, y) and g(x, y) are again taken as com- 
ponents of a vector field A, g this time being the x-component 
and f the y-component, and if we again interpret %(s) and 9(s) 
as the components of the tangential unit vector ¢#, we see that 
the integrand on the right can be written in the form At= A,, 
where A? is the scalar product of the vectors A and #, i.e. the 


* For “ differentiation in a given direction ” see Chap. IT, section 4 (p. 62). 
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tangential component of the vector A. The integrand on the 
left we have already met with (p. 92) in forming the curl. In 
order to apply the concept of curl here we imagine the vector 
field A extended in any way in space, e.g. by taking the z-com- 
ponent everywhere equal to zero. The integrand on the left is 
then just the component of the vector curl A in the z-direction, 
so that the above equation for the plane can be written in the 
following form: 


J fount A), dxdy = [Acds. 


If by the curl of a vector field in the zy-plane we mean the 
z-component of the vector curl A, where A is any vector field 
obtaimed by extension as above, we can formulate Gauss’s 
theorem as follows: 

The integral of the curl of a plane vector field over a closed region 
as equal to the integral of the tangential component taken round 
the boundary. This statement is commonly referred to as Stokes’s 
theorem in the plane.* 

If we now make use of the vector character of the curl of a 
vector field in space and observe that the above result involves 
the components of the vector field in the zy-plane only, we can 
free Stokes’s theorem for plane regions from the restriction that 
these plane regions lie in the zy-plane. We thus arrive at the 
following more general statement of Stokes’s theorem: 


f i. (curl .A),dS = , A, ds, 


where T' is any plane region in space, bounded by the curve C, 
and (curl A), is the component of the vector curl A in the direction 
of the normal to the plane containing T. 


* We remark in passing that Gauss’s theorem or Stokes’s theorem can be 
used to give a new simple proof for the main theorem on line integrals 
(section 1, p. 353), in particular, for the fact that the condition f, = gy is suffi- 
cient to ensure that the line integral is independent of the path. We have seen 
that this independence of the path is equivalent to the vanishing of the integral 
round every closed path. If such a path is the boundary of a region R of the 
type considered, Stokes’s theorem transforms the line integral 


Sig(x, y)da + f(z, y)dy} 
+0 


into the integral of the expression f, — g, over the region; and if this expression 
vanishes, the vanishing of the line integral immediately follows. 
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3. Green’s Theorem. Integral of the Jacobian. 


Certain other integral transformations, usually known as 
Green’s theorems, are closely related to Gauss’s theorem. They 
have many applications in the theory of differential equations. 
In order to obtain these theorems we consider two functions 
u(x, y) and v(2, y), which we assume to have continuous deriva- 
tives of the first and second order in the region R. In virtue of 
the equations 
7) 7) 

a (Ug) = UgVs + Wea, ay (W0y) = UyPy TF Wyy, 


ea 
Gauss’s theorem gives the formula 
f i (gn + Ugg TF UyVy + UWy,) dady = f {uv,dy — uv, da} 
a +O 
or 


ff [are u,p,)dedy=— f [udvdedy+ f{—wy,da+ uv, dy}, 


where, as in Chap. IT (p. 93), we use the symbol 
Av = Vang + Uyy- 


This last integral formula is called Green’s (first) theorem. 
It has been proved above, subject to the assumption that the 
functions Uz, Vz, My Vy» Vex Vyy ate continuous in the closed 
region. If in addition we assume the continuity of the functions 
Une and U,,, We can in a similar way obtain the formula 


f feet uyry)dady=— ff odudady + {{—vu,de-+ vu, dy}, 


and from these two formule we obtain by subtraction the relation 
known as Green’s (second) theorem: 


f [ude — vAu)dxdy = f{(ouy — uv,)dx — (vu, — wv,) dy}. 


We can write the line integral in Green’s theorem somewhat 
differently if we recall that the derivative of a function f(«, y) 
in the direction of the outward-drawn normal to the curve is 
given by the equation 


7] ; ; 
anal y) = fey — fut, 
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provided that the direction in which s increases is that corre- 
sponding to positive description of the boundary. Thus, if in 
general we use the symbol 0/0n to denote differentiation with 
respect to the outward-drawn normal to the curve, Green’s 
theorems can be written in the form 


J f (use + Uv) dady = —f fedudedy = ~ ds 


J f(udo— odu) dedy = [ (us 5 ue 


We can also express the first form of Green’s theorem in yet 
another way, by means of the vector notation: 


J f(gadu grad v)drdy = —f fe div grad udady +fe Mas 


and 


Here the quantity under the integral sign on the left is the scalar 
product of the two gradients, gradu and gradv, and the symbol 
Au is replaced by the equivalent symbol div grad wu. 

We obtain another remarkable relation between integrals if 
we transform the double integrals of the products u,v, and u,v, 
respectively into line integrals by means of Gauss’s theorem and 
then subtract: 


f f (u,v, — U,v,) dady = if {wv, da + uv, dy}. 
+o 


This formula gives us a new insight into the nature of the 
Jacobian. As the integrand on the left we have the Jacobian 
oe ‘ We assume that the Jacobian is positive throughout the 

2Y 
region R and that the region R of the zvy-plane is mapped on a 
region R’ of the wv-plane by means of the equations 


Uu= Us, y¥), v= (2, ¥), 
the sense of description of the boundary being preserved since 
O(u, v) v) 
O(a, y) y) 
given by the line integral 


fude = [weeds + v,dy) 


+0 


> 0. Tho area of the region R, as we already know, is 
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taken round this boundary in the positive sense. Thus the in- 


tegral of the Jacobian 
i i o(u, °) an 
a(z, yn 


gives the area of the image region, and 


Jf faudo= ff eo ae ade 


Thus we have once again obtained the transformation formula 
of Chap. IV (p. 253) for the special case in which the integrand 
on the left is unity. If we divide the integral 


f i o(u, ”) ae 
a(x, y 
by the area of the region R and then let the diameter of R tend 


to zero, in other words, if we carry out a space-differentiation 


of this integral, in the limit we obtain the integrand, that is, the 
a(u, 5 The Jacobian is therefore the limit of the quotient 


of the area of the image region and the area of the original region 
as the diameter tends to zero, or, as we may say, it is the local ratio of 
areal distortion.* 


Jacobian 


4. The Transformation of Aze to Polar Co-ordinates. 

A process like that of the last sub-section enables us to trans- 
form the expression Au = uz, —+ U,y to new co-ordinates, e.g. to 
polar co-ordinates (r, ). For this purpose we use the formula 


J [oudady = Fi a 


which arises from Green’s theorem if we put v= 1. If we divide 


* Since by the mean value theorem of the integral calculus the ratio of the 
area of a region to the area of its image is given by an intermediate value of 
the Jacobian, the definition of the double integral now leads us almost at once 
to the general transformation formula 


ff [te v)dudv - ff t5e a) de dys 


the reader may work out the details for himself. For another complete proof 
of the transformation formula cf. section 3, No. 3 (p. 373). 
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both sides of this equation by the area of the region R and 
let the diameter of R tend to zero—that is, if we carry out a 
space-differentiation—in the limit we again obtain the expres- 
sion for Aw. 

In order to transform Au to other co-ordinates, we therefore 
have only to apply the corresponding transformation to the simple 


line integral f = ds, divide by the area, and perform a passage 
0 


to the limit. The advantage over the direct calculation is that we 
need not carry out the somewhat complicated calculation of the 
second derivatives of u, since only the first derivatives occur in 
the line integral. 


As an important example we shall work out the transformation of Au 
to polar co-ordinates (r, 6). For the region R we choose a small mesh of 
the polar co-ordinate net, say that between the circles r and r +- h and 
the lines 6 and 8+ k, whose area, as we know, has the value khp, where 


e=art 3h. 
By our general discussion we then have 
1 ou 
Au = —= — ds, 
a~>o pkh J On 


or, if we calculate the line integral for our special boundary, 


O+k 

Aiea 2 ie (r+ yudle + h, 8) — rlrs ) ag 
h>0P 

k—>0 


h 


1 7th u(r, 8 + kb) — u(r, 9) a ar 
hd, kr ° 


If we use the mean value theorems, we can also write this equation in 
the form 


: 1 1 
Au =lim ~ {Pytre(Pr 61) + Uy{T 0,) + = Ugela 92)}, 
ee 


where 7,, 7, and 0,, 0, denote values of the variables r, 8 which lie between 
rand r-+ A and between 6 and 6-+ % For the limit as 4 > 0, k +0 
we at once obtain 


1 1 
Au= (1tty)p + 7a Ue. 


which is the required transformation formula, 
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3. INTERPRETATION AND APPLICATIONS OF THE INTEGRAL 
THEOREMS FOR THE PLANE 


l. Interpretation of Gauss’s Theorem. Divergence and Intensity 
of Flow. 


We shall now interpret the integral theorems given in the previous 
section in terms of the steady flow of an incompressible fluid in two dimen- 
sions. Such a flow (which of course is only an idealization of actual physical 
conditions) occurs when a fluid distributed over a plane with constant 
surface density unity moves in such a way that the state of motion, that, is, 
the velocity vector at each point, is independent of the time (which is what 
we mean by the term “ steady”). Such a flow is therefore determined by 
the field of its velocity vector v. We shall call the components of this 
velocity vector v, and v,. If we consider any curve C to which we 
arbitrarily assign a positive direction of the normal—we denote the unit 
vector in the direction of the normal by #—then the total amount of the 
fluid which passes across the curve in the positive direction of the normal 
in unit time is given by the integral 


af onds, 
oO 


if we denote * the length of arc on C by s. If the curve is closed and encloses 
a region R, and if # is the outward-drawn normal, then Gauss’s theorem 


fonds = f [div vaedy 
oO R 


states that the total amount of fluid leaving the region R in unit time is 
equal to the integral over the region of the divergence of the velocity field. 
This statement at once leads us to the intuitive interpretation of the concept 
of divergence. The line integral on the left will not in general vanish. If 
it has a positive value, the total amount of fluid in the region is decreasing; 
if it has a negative value, the amount of fluid is increasing. If the whole 
phenomenon is steady, i.e. independent of the time, so that there can be 
no increase or decrease in the amount of the fluid in the region, the sub- 
stance is necessarily being created or destroyed in the region itself. We 
say that the region encloses sources or sinks; the steady character of the 
flow is then expressed by the fact that the sources or sinks regulate the 
entry or exit of the fluid in the interior in such a way that the amount 
of fluid remains constant within each region. The total amount of fluid 
leaving the region may be called the total flow out of the region. This is 
positive or negative according as the sources or the sinks predominate. 
If we divide the total flow by the area of the region, we obtain the average 


*In order to see that the integral actually has this meaning, we first think 
of the curve as replaced by a polygon with sides of length As,, As, ... , Asn, 
assume that on each side of the polygon the velocity vector is constant, and then 
perform the usual passage to the limit from polygon to curve. 
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or mean intensity of flow. If we now let the diameter of the region tend to 
zero, that is, if we carry out a space-differentiation, we obtain in the 
limit the iniensity of flow at the point in question. Gauss’s theorem tells 
us that div v, the divergence of the velocity field, is equal to the intensity of 
flow. Gauss’s theorem accordingly leads to an intuitive interpretation of 
the hitherto purely formal concept of divergence. 

This interpretation of the divergence can also be roughly expressed in 
the following way: we think of the flow as divided into a flow in the direc. 
tion of the z-axis with velocity v, and a flow in the direction of the y-axis 
with velocity v,, and consider a rectangle with corners P,(&, 7), Po(&+ 4, 9), 
PSE, n+ k), P(E+h, n+ kh). If the velocity v, were constant along 
each of the two sides P,P, and P,P, and had the respective values 
v,(E, 4) and v,(& +h, 7) there, the total amount of fluid leaving the 
rectangle in the x-direction in unit time would be given by the difference 
ku,(E + h, n) — kv,(&, y). If we divide by hk, the area of the rectangle, 
we obtain 


»( + &, n) — 04(8, a) 
h 


The average net flow out of the region in the direction of the y-axis is 
obtained in the same way. The expression 


of& + h, n) iets v,(&, n) + v,( &, q + k) = v,(&, n) 
h k 


therefore gives an approximation to the average net flow out of the region, 
and the passage to the limit 4 — 0, & - 0 again leads to the meaning of 
the divergence given above. 

Special] interest attaches to the case of a source-free flow, that is, a flow 
in which fluid is neither created nor destroyed in the region under con- 
sideration. This type of flow is characterized by the condition 


div v = 0, 


which by Gauss’s theorem is equivalent to the condition 


fends = 0, 
0 


where the integral is taken round any closed curve. 


2. Interpretation of Stokes’s Theorem. 


Stokes’s theorem can also be interpreted in a simple way in terms of 
the flow of an incompressible fluid in two dimensions. Let the velocity of 


flow be given by the vector v with components v,, v,. The integral if v,ds 


+o 
taken round a closed curve C we shall call the circulation of the fluid along 
this curve. By Stokes’s theorem this can at once be expressed in the form 


[vas =f feutvdedy, 
0 zB 
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and this equation further shows us that the expression curlw is to be 
regarded as the specific circulation or circulation-density at a given point. 
Stokes’s theorem then states that the circulation along the curve C is 
equal to the integral of the circulation-density over the region enclosed 
by the curve. 

Here again special interest attaches to cases of flow for which the 
circulation along every closed curve is zero, so that by Stokes’s theorem 
the circulation-density vanishes everywhere. Such flows are said to be irro- 
tational, and are characterized by the equation 


curl v = 0. 


If a steady flow is both source-free and irrotational, it satisfies the two 
systems of equations 


curly = 21 _ 2 _ , 
Ou 
ov, dv 

div v= —1+4+72—09, 
Ox + oy 


These two equations, by the way. are of special interest in that they 
occur in other branches of mathematics, in particular, in the theory of 
functions of a complex variable*, thus forming the connexion between the 
latter subject and hydrodynamics. 

We shall mention yet another interpretation of Stokes’s theorem. 
If we think of v aa representing a field of force instead of a velocity field, 
the line integral 


fot a cae + 0,dy}, 


taken round any curve, closed or not, gives the work done by the field 
of force on a particle describing the curve C. If C is a closed curve which 
forms the boundary of a region R, then Stokes’s theorem states that the 
work done in describing the boundary of F is equal to the integral over R 
of the curl of the field of force. If the work done in describing a closed 
path is always to have the value zero, the equation 


lv = — =0 
curly Da 


must be true everywhere. Conversely, if this equation is true everywhere, 
it follows from Stokes’s theorem that the integral 


if v,de = f (v,da + vgdy) 
+0 


+0 


vanishes everywhere (cf. p. 365, footnote). 


* Cf. Chapter VIII, pp. 532, 550. 
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This result shows, in accordance with section 1, p. 358, that the work 
done is independent of the path tf, and only if, 


curl vy = 0 
throughout the region. 


3. Transformation of Double Integrals. 


As an application of Gauss’s theorem we give another method 
for deriving the transformation formula for double integrals 
(cf. Chap. IV, section 4, p. 253, and p. 368, footnote). Let us 
suppose that FR is a closed region of the zy-plane bounded by the 
curve C and that the transformation = 2(u, v), y= y(u, 2) 
gives a one-to-one mapping of R on the region R’ of the uv-plane 
bounded by the curve C’, the sense of description of the boundary 
being preserved. Let the two regions satisfy the conditions for 
the applicability of Gauss’s theorem. In order to transform the 
integral 


I=f f Sle, y)dedy 


into an integral over the region R’ we first transform it into a 
line integral round the boundary C. This line integral, being a 
simple integral, can at once be transformed into a line integral 
round C’, the boundary of R’, and the latter, by Gauss’s theorem, 
can be transformed into a double integral over R’. In order to 
carry out this process we consider any function A(z, y), obtained 
from f by indefinite integration, for which 


A,=f. 
Then by Gauss’s theorem 


I = [ fAsdedy = [ 4d. 


If in the line integral on the right we now introduce the variables 
u, v instead of «, y, i.e. if we transform it by means of the func- 
tions x(u, v) and y(u, v) into an integral along the boundary 
C’ of R’, we at once obtain 


I= f A(y,du + yydv). 


To the boundary integral on the right we apply Gauss’s theorem 
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in the reverse direction, transforming it into a double integral 
over R’: 


J (Ays)du + (Aye)do— ff Ayee— (Ayn oldudo, 


From the equations 
(AYyu)» = Au oe AYou 


(Ayo)u = Avo ++ AYuns 


and 


as well as 
Au=Artut Ayu, An=Astyt+ Ayo, 4e=f, 
we find after a short calculation that 
(AYo)u — (Ayu)o = (tuo — Zefuf, 
so that finally 
=f f facdy =f f (eae — my,)fdudo, 


as was to be proved. 


4. Surrace INTEGRALS 


The theory of integration for three independent variables 
includes not only triple integrals and line integrals but the third 
concept of the surface integral. In order to explain the latter 
we begin with some considerations of a general nature, which at 
the same time will serve to refine our previous ideas, in par- 
ticular those relating to double integrals. 


1. Oriented Regions and Integration Over Them. 


b 
We start from the ordinary integral if f(x) dx of a function f(z) 


of the independent variable z. The region of integration is the 
interval between «=a and w==6. We are necessarily led 
(Vol. I, p. 81) to the convention 


[faa =—['Fe)de, 


which we can also express in the following way: the region of 
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integration, that is, the interval R under consideration, is given 
a definite direction, or, as we say, a definite orientation. If we 
reverse the orientation, that is, if we describe the interval in the 
opposite direction, the value of the integral is multiplied by —1. 
This convention may also be expressed by the equation 


f Ste) de = — f Sed, 


where the region of integration is denoted by +C when it is 
described in the direction a > 6 and by —C when it is described 
in the direction b — a. 

Tn the case of line integrals in the plane and in space we have 
likewise seen that it is necessary to assign a definite sense of 
description to the curve along which we are integrating, and that 
if this orientation is reversed the integral is multiplied by —1. 
It is now evident that a full treatment of the case of integration 
over regions of several dimensions demands the adoption of 
analogous conventions, and that our previous definitions should 
be extended accordingly. 

In Vol. I, p. 268, we gave a definite sign to the area of a region 
R, the sign being positive or negative according as the sense of 
description of the boundary is positive or negative. A plane 
region to which we attach a definite sign in this way we call 
an oriented region (fig. 7); in accordance with what we have just 
said, we shall call it positively oriented if the sense of description 
of the boundary is positive, otherwise negatively oriented. Now 
we have represented the area of a region R by the double integral 


f f dady. If this area is to be taken as positive, we shall attach 
R 


to the region a positive sense of description of the boundary, 
and we accordingly represent the absolute value of the area 
symbolically by the expression 


Jf dety={ Al. 


If we think of the region as negatively oriented, so that its area 
is negative, we express the actual value of the area by the symbol 


f f dady, and accordingly have the definition 
—B 


Jf dedy=—| Al. 
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Again, the area is expressed * as a line integral by the formula 
|4|=—fyde = [ady. 
+0 ‘+0 


If nothing special is said to the contrary, we shall always 
take & as a positively oriented region. 


Fig. 7.—-Oriented regions 


In the same way, we now state the general definition for any 
double integral whatever: 


J [fe y) da dy =f [fe y) dx dy; 
J [fe y)dzdy = — ff fe, y) da dy. 


This definition corresponds exactly to the convention already 
adopted in the case of ordinary integrals and line integrals. 
The equations do not represent any newly-proved fact: they 
are simply definitions and are justified solely on grounds of con- 
venience. 


An example will illustrate the usefulness of this convention. We saw 
(p. 253) that in the one-to-one mapping of the region R of the zy-plane 
on a region R’ of the uv-plane the area of the region R is given in the new 
co-ordinates by the integral 


J feed ={fae D aude 


provided that the Jacobian is positive everywhere in R. We know that 


*It is useful to verify by an example that the integral — f ydz is really a 
+8 


positive number. If, for example, R is a square 0 <2 Sl, 0 Sy S1, on both 
the vertical sides we have dx = 0. The side y = 0 likewise contributes nothing 
to the line integral; and on the third side we have dz < 0 and y= 1, 
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if the Jacobian is positive, the orientation (ie. the sense of description of 
the boundary) of R and RF’ is the same, while if the Jacobian is negative, 
the regions have opposite orientations. The above formula therefore would 
not hold if the Jacobian were negative, if we considered the double integral 
without regard to the orientation. But tf remains true for the case of a 
negative Jacobian if by R we mean a (positively or negatively) oriented 
region and by R’ the oriented region which arises from FR as a result of the 
transformation. Yor if the orientation is reversed, the effect of the negative 
sign of the Jacobian is cancelled by the above convention. 


In the same way, we can now regard the general transformation 


equation 
[Je naedy =f f few) oe 7 dudv 


as valid, whether the Jacobian A 7 is positive everywhere or * 
negative everywhere in the region R, it being assumed that the 
integrals are taken as integrals over oriented regions and that 
in the mapping the oriented region & becomes the oriented region 
R’. Thus only by introducing orientation and the sign principle 
do we arrwe at transformation formule for double integrals which 
are valid without exception. 

The orientation of a region can also be defined geometrically 
without reference to the boundary in the following way. We 
first consider any point of the region whatever, and to this point 
assign a sense of rotation, which we can represent e.g. as the 
sense of description of a small circle with this point as centre. 
We now say that the region FR is oriented if such a sense of rotation 
is assigned to every point of R and if on continuous passage 
from one point to another the sense of rotation is preserved. 

By means of this remark we can now assign an orientation to 
a surface lying in xyz-space. On the surface we can first assign 
a sense of rotation to a point by surrounding it by a small curve 
lying on the surface and assigning a definite sense of description 
to this curve. If we now move the point continuously over the 
surface to any other position and along with the point move the 
oriented curve with its orientation, we assign a sense of rotation 
to every point of the surface in this way (exceptional cases will 

* The formula does not hold, however, if the Jacobian changes sign in the 


region; in this case the assumption that the mapping is one-to-one cannot be 
satisfied. 
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be discussed later). We call the surface with this sense of rotation 
an oriented surface (fig. 8). 

We can get a better grasp of this orientation of a surface in 
space as follows. A portion of a surface in space will have two 
different sides, which we can best distinguish as the positive side 
and the negative side. (Which of the two sides we call positive 
and which negative is of no intrinsic importance.) For example, 
as the positive side of the xy-plane we can take the side indi- 
cated by the positive z-axis. We now mark the positive side of 
a surface S by constructing at each point of the surface a 
vector pointing out into space on the positive side; e.g. the 
normal to the surface, if a unique normal exists at the point. If 
we think of ourselves as standing on the surface with our heads 
on the positive side, we say that the surface is positively oriented 


= 


Fig. 8.— Orientation of a curved surface 


of the orientation of the surface and the line from feet to head to- 
gether form a right-handed screw (cf. Chap. I, p. 2), or, in other 
words, if the surface together with its orientation can be con- 
tinuously deformed in such a way that it becomes the positively 
oriented xy-plane and at the same time the direction of the positive 
normal becomes the direction of the positive z-axis. Otherwise, 
we say that the surface is negatively oriented. We thus see that 
there is a natural way of determining the sign of the orienta- 
tion of a surface, provided that the two sides of the surface are 
given signs to begin with. Any difficulty which the beginner may 
find in these matters lies simply in the fact that here we are 
discussing not proofs but definitions, which are justified solely 
by their convenience in simplifying subsequent discussion. 

We must not omit to mention that curved surfaces exist to 
which vo orientation can possibly be assigned, since on them 
it is not possible to distinguish two separate sides. The simplest 
surface of this type was discovered by Mébius and is called the 
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Mobius band; it is shown in fig. 9. We can easily make such a 
surface from a long strip of paper by fastening the ends of the 
strip together after rotating one end through an angle of 180° 
from its original position. The Mébius band has the property 


Fig. 9.—The Mibius band 


that if we start from a definite point, say on the centre line of 
the band, and move along the centre line, after a complete circuit 
we come back to the same point, but on the opposite side of the 
surface.* If during this motion we carry with us a small oriented 
curve, without altering its orientation, we shall find that we 
return to the starting-point with the orientation reversed. We 
see that with such a surface we can pass from one side to the 
other without crossing the boundary, and hence that it is 
impossible to assign to the surface an orientation in the sense 
described above. Such non-orientable surfaces are definitely 
excluded from the subsequent discussion. 


* We can obtain a parametric representation for the Mdbius band as follows. 
Consider first the circle x = 2cosu, y= 2sinu. At the point of the circle 
corresponding to the value u of the parameter we construct the unit vector /, 
which starts from the point of the circle, lies in the same plane as the z-axis 
and the radius to the point, and makes the angle 4u with the positive z-axis. 
At the same point we also construct the vector —7. Thus we have a line segment 
composed of the two vectors, with length 2 and its mid-point on the circle. As 
u goes from 0 to 2m this line segment travels with wu, turning through an angle 
a, 80 that finally 7 comes to the original position of —7. It is therefore clear 
that the line segment describes a Mébius band. For each value of u the point 
on the line at the distance v from the circumference in the direction of 7 (where 
~1<vS +1) has the co-ordinates 


1 ub 

x = 2cosu + vein 5 cos w, 
« ~ U.. 

y = 2sinu + vain 5 sinu, 


z= » cos & 
y 


where 
OsSus2r 
-lsrsl. 


These equations therefore represent the Mébius band parametrically. 
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We can also express the orientation of a surface by thinking 
of the surface as represented parametrically by two parameters 
wand v. Then a definite region R of the uv-plane will be mapped 
on the surface S. If in the region R we choose any orientation, 
the mapping transfers this orientation to the surface S, thus 
defining an orientation of the surface. 

Just as we can assign an orientation to a region in the plane 
or to a surface, we can also assign an orientation to a three- 
dimensional region. For this purpose the following convention 
is advantageous. We consider a region of space R bounded by 

a closed surface S. We take the 

side of the surface towards the 

interior of the region as the positive 

side. If we give the surface an 

orientation which with the direction 

from negative to positive across the 

surface determines a right-handed 

screw, we say that the region of 

space R is positively oriented (cf. 

fig. 10); if, on the other hand, we 

give the surface an orientation 

which with the negative to positive 

Fig. 10.—Positively oriented sphere direction determines a left-handed 

screw, we say that the region is 

negatively oriented. For example, the cube 0 < 2 <1, 

0SyS81, 0S2zS1 is positively oriented if we give its base 
in the zy-plane a positive orientation. 

For regions in space, just as for regions in the plane, it is 
convenient to assign a positive or a negative sign to the volume 
according as the region is positively or negatively oriented 
(cf. p. 376). We shall again agree that an integral taken over 
an oriented region has its sign changed if the orientation of the 
tegion is reversed: 


Sif fe y, 2) derdy de = —[ff fe y, 2) da dy de. 


The same argument as we have already developed for two 
dimensions shows again that the transformation formula 


Sf{fe y, 2) dady dz =f ff fe. y, 2) MY?) dudvdw 


O(u, v, w) 
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only acquires full validity when these conventions are adopted, 
O(a, ¥, 2) ; 
a is 
O(u, v, w) 
negative everywhere in the region. For, as we explained for two 
dimensions in Chap. III (p. 151), a mapping of R on R’ with a 
negative Jacobian reverses the orientation. 


since it now continues to hold when the Jacobian 


2. Definition of the Integral over a Surface in Space. 


Having made these preliminary remarks, we can now give a 
general definition of the concept of surface integral. We consider 
a region of zyz-space in which the three continuous functions 
a(x, y, z), d(x, y, 2), cw, y, 2) are defined as components of a 
vector field A = A(x, y, z). We first consider a surface S which 
has a one-to-one projection on a closed region R of the xy-plane 
and is defined by an equation z= 2(z, y); we assume that this 
surface is given an orientation which is transferred by projection 
on the «y-plane to the region R. We use the letter 22’ to denote 
the unit vector in that direction normal to the surface S which 
in conjunction with the orientation of the surface forms a right- 
handed screw. We now divide the surface S into n portions * 
S,, Sg, ..., S, with areas AS,, AS,,..., AS,. The projections 
of these portions on the zy-plane form a number of sub-regions 
R, of the region R, with areas AR,, AR,, ..., AR,, and these 
regions cover the region R exactly once. We take the areas 
AS, a8 positive, and accordingly have to assign a positive or 
negative sign to the area AR, according as the projection gives 
@ positive or a negative orientation to the corresponding regions 
R or R, in the zy-plane. The areas AS,, AS, ..., AS, and 
AR,, AR,, ..., AR, are connected by an equation of the form 


AR, = g, AS, , 


where g, denotes a quantity which tends to the cosine of the 
angle (zx, y, z) between the positive normal direction 7’ and the 
positive z-axis as the diameter of the portion S, approaches 
zero. Now let (z,, y,, 2,) be a point in the v-th sub-region of the 
surface; ie. 2,== 2(2,, y,). Then if the greatest diameter of the 
portions S, (and with it the diameters of the sub-regions R,) 
tends to zero, the sum 


* In this connexion see Chap. IV, section 6 (p. 269). 
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Eolz,, y» 2)AR, = Dele, y,, 2,)9,AS, 
vel v=1 


tends to a quantity which we denote by the symbol 
f f (a, y, 2)dxdy 
8 


f f{ C(x, y, 2) cosy dS. 
8 


We call this expression the surface integral taken over the oriented 
surface S. This limit does actually exist, since we may regard 
the integral as an ordinary integral over the two-dimensional 
oriented region R, namely as the integral 


J [ededy, 


where the integrand is the function c{z, y, 2(x, y)}. 

For the generalization to which we now proceed and for 
applications it is essential that in this integration the region R 
should be regarded as oriented. 

If the surface S also has a one-to-one projection on the 
yz-plane or the 2z-plane, that is, if it can be represented by a 
single-valued function z= x(y, z) or y= y(z, x), we can in the 
same way define the integrals 


f f a(a, y, z)dydz = a. f a{x(y,z), 4,2} dydz = f i a(x, y, z) cosadS 
and. 


f f b(2, y, 2) dzda == f f bf, y(2,2),2}dzda = f f B(x, y,z) cos BdS, 


where R’ and R” are the oriented projections of the oriented 
surface S on the corresponding co-ordinate planes and a and B 
are the angles between the positive normal to the surface and the 
positive x- or y-axis respectively. 

Adding these expressions, we obtain the general definition of 
the surface integral taken over the surface S, 


f f {a(x, y, z)dydze + b(a, y, 2)deda + oa, y, z)daxdy} 
= fA Hi {a(x, y, z) cosa -++ b(x, y, 2) cos B + oz, y, z) cosy}d8. 


or 


VJ SURFACE INTEGRALS 383 
If by dfn’ we denote differentiation in the direction of the 


unit vector 2’ in the positive normal direction,* we can also write 


Ox 
cosa= =, cosp= 


ey bad 
On” On” on” 
and consequently we can express the surface integral in the form 


flo +6 + of ai} as. 


If a, b, c are the components of a vector A, the quantity 
in brackets under the integral sign is the component of the 
vector A in the direction of the positive normal to the surface, 
which we can also write in the form Az’ or Ay. 

Incidentally, if we think of the surface as given parametrically 
by the equations z = 2(u, v), y = y(u, v), 2 = 2(u, v), where the 
oriented surface S corresponds to the oriented region B in the 
uv-plane, we can write the surface integral in the form 


BY, 2) 1 aa, y, 2) 2) 4. ote, y, 2) } 

Jf fate wa TEE + Ble, ys 2) eS + oles y 2) eB | dude 
and thus once again express it as an ordinary integral, namely, 
as a double integral over B. 

It is now easy to get rid of the special assumptions about the 
position of the surface S relative to the co-ordinate planes. We 
assume that the oriented surface S can be divided by a finite 
number of smooth arcs of curves into a finite number of por- 
tions S,, S,, ... in such a way that each portion satisfies the 
assumptions made above. The exceptional case in which a 
portion of the surface S or the whole surface S is normal to 
a co-ordinate plane, so that its projection on that plane is only 
a curve instead of a two-dimensional region, can be dealt with 
by disregarding this projection in the formation of the integral, 
since a double integral vanishes when the region of integration 
shrinks down to a curve. We can now form the surface integral 
for each of the portions S according to the above definition, and 
we can define the integral over the oriented surface S as the sum 
of the integrals thus defined. 

If, for example, the surface S is a closed surface, a sphere, 
say, we recognize that the projections of the various portions S, 


cosy = 


* The letter n’ is used here for the positive normal because n has been used 
for the outward-drawn normal in two dimensions. 
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lie partly above one another and have opposite orientations. If 
the parametric representation x= 2(u, v), y= y(u, v), 2 = 2(u, 0) 
gives a one-to-one mapping of a bounded surface S on an oriented 
region B of the uv-plane, the parametric expression given above 
for the surface integral is always valid; if we make use of this 
parametric expression in defining the surface integral, there is 
no need to subdivide the surface S. 


3. Physical Interpretation of Surface Integrals. 


The concept of surface integral can also be interpreted intuitively in 
terms of the steady flow of an incompressible fluid (this time in three 
dimensions), whose density we take as unity. Let the vector A be the 
velocity vector of this flow; then at each point of a surface S the product 
An’ gives the component of the velocity of flow in the direction of the 
positive normal to the surface; the expression 


An'S, = AS, {a(2,, Yr 2) COB a, + B( 2,5 Yr» Z,) C08 By + C(2,, Yr» 2,) COB Y,} 


is therefore approximately equal to the amount of fluid which flows in 
unit time across the element of surface S from the negative side of the 
surface to the positive side (this quantity may of course be negative). 
The surface integral 


f fide 4 bdedz + edxdy} = f f4nas 
8 


therefore represents the total amount of fluid flowing across the surface 8 
from the negative side to the positive in unit time. We notice here that 
an important part is played in the mathematical description of the motion 
of fluid by the distinction between the positive and negative sides of a 
surface, ie. by the introduction of orientation. 

In other physical applications the vector A denotes the force, due to a 
field, acting at a point (x, y, z). The direction of the vector A then gives the 
direction of the lines of force and its absolute value gives the magnitude 
of the force. In this interpretation the integral 


J [reduae + bdedx + cdedy} 
8s 


is called the total flux of force across the surface from the negative side 
to the positive. 


5. Gauss’s Turorem anp Green’s THEOREM IN SPACE 


1. Gauss’s Theorem and its Physical Interpretation. 


By means of the concept of a surface integral we can extend 
Gauss’s theorem, which we proved in section 2 (p. 360) for two 
dimensions, to three dimensions. The essential point in the 
statement of Gauss’s theorem in two dimensions is that an integral 
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taken over a plane region is reduced to a line integral taken round 
the boundary of the region. We now consider a closed three- 
dimensional region R in 2yz-space and assume—as always—that 
its boundary surface S can be divided into a finite number of 
portions with continuously turning tangent planes. In addition, 
we assume to begin with that each line parallel to a co-ordinate 
axis which has internal points in common with R cuts the 
boundary of R in exactly two points; this last assumption will 
be removed later. 

Let the three functions a(z, y, z), b(x, y, 2), c(x, y, 2), together 
with their first partial derivatives, be continuous in the region R 
and on its boundary; we take them to be the components of a 
vector field A = A(z, y, z). We now consider the integral 


Sf [POY avdyde 


taken over the region R. We suppose that the region R is pro- 
jected on the zy-plane; we thus obtain a region B in that plane. 
If we erect the normal to the xy-plane at a point (x, y) of B and 
if we denote the z-co-ordinates of its point of entrance and point 
of exit by z= z,(z, y), = 2%(2, y) respectively, we can transform 
the volume integral over R by means of the formula 


ff [ facdyde= J faedy if fae. 


Since f = 0c/dz, the integration with respect to z can be carried 
out, giving 
*10¢ 
f a, = (2, ¥, 4) — O(@, Y, %) = C — Cp, 
so that 


[ff PEM? aeayae=f fo dvdy — f fegdedy. 


If we think of the surface S as positively oriented with respect to 
the region R, then the portion of the surface S consisting of the 
points of entry z= 2z,(z, y) has a positive orientation when pro- 
jected on B, while the portion z = 2,(z, y) consisting of the points 
of exit has a negative orientation. Hence the last two integrals 
combine to form one integral 


—f fe, y, 2) dady 


14 (B912) 
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taken over the whole surface S. We thus obtain the formula 


[f [PSX aedyae= —f foe. y, 2) dedy. 


This formula obviously remains valid if S contains cylindrical 
portions perpendicular to the zy-plane; for these contribute 
nothing to the surface integral, as the regions obtained by pro- 
jecting them orthogonally on to the xy-plane merely consist of 
curved lines. 

If we obtain the corresponding formule for the components 
a and b and add the three formule, we obtain the general formula 


da(x, y, 2) , Ob(z, y, 2) , Oc(z, y, 2) 
Sfhh Oa 7 oy * oz | dedyde 


= —f{ [{ae@, y, 2) dydz +- W(x, y, z)deda +- ela, y, 2)dady}, 


which is known as Gauss’s theorem. Using the notation of p. 382, 
we can also write this in the form 


Lf [eot+2,+0,)dedyde=— f { (acosa+ bcos B+ coos y)d8. 


Here the surface is to be positively oriented with respect to R; 
a, B, y are accordingly the angles which the inward-drawn normal 
n’ makes with the positive co-ordinate axes. 

This formula can easily be extended to more general regions. 
We have only to require that the region R is capable of being 
subdivided by a finite number of portions of surfaces with con- 
tinuously turning tangent planes into sub-regions R,, each of 
which has the properties assumed above, in particular, is such 
that every line, parallel to an axis, having points in common with 
the interior of R, cuts the boundary of R, in only two points. 
Gauss’s theorem holds for each region R,. On adding, we obtain 
on the left a triple integral over the whole region R; on the 
right, some of the surface integrals combine to form the surface 
integral over S, while the others (namely, those taken over the 
surfaces by which R is subdivided) cancel one another, as we 
have already seen in the case of the plane (pp. 348, 362). Finally, 
we remark that, as before (p. 362), it is sufficient to require that 
the boundary of R consists of a finite number of portions of 
surfaces, each of which has a unique projection on ali three 
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co-ordinate planes, except that cylindrical portions whose pro- 
jections are curves are again permissible. 


As a special case of Gauss’s theorem we obtain the formula for the 
volume of a region R bounded by an oriented closed surface. If, for example, 
we put a = 0, b = 0, c= z, we immediately obtain the expression 


V =f f fexdyac = — f feavdy 
R & 
for the volume. 


In the same way, we also obtain the expressions 
=—f [xdyae=— f fudede 
8 8 


As in the case of the corresponding formula in the plane, it 
is usual to express Gauss’s theorem in another form. In the 
first place, if a, b,c are the components of a vector field A, 
we can write the expression 


for the volume.* 


da , Ob , Oc 
da ay * 3 
in the abbreviated form introduced in Chap. IT, section 7 (p. 91), 


Oz Oy 


In the second place, the discussion on p. 383 enables us to express 
the surface integral as the integral of the normal component 4,, 


of the vector A in the direction of the inward-drawn normal »’. 
Thus we obtain the vector form of Gauss’s theorem, 


[f [div Adedyds = —f f An'd8 = —f fAyd8. 


* It is noteworthy that cyclical interchange of x, y, z in these expressions 
brings about no change of sign, whereas in the case of the corresponding formulz 
for the area of a two-dimensional region the formula 


A = fxdy = —fydz 
+0 +0 


shows that interchanging x and y causes a change of sign in the integral expres- 
sion. This is due to the fact that in two dimensions an interchange of the positive 
z-direction with the positive y-direction reverses the sense of rotation of the 
plane, while in three dimensions a cyclical interchange of the positive co- 
ordinate directions, that is, replacement of x by y, of y by z, and of z by 2, 
does not change a right-handed system of axes into a left-handed system. 
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In Gauss’s theorem for space, as in the case of the plane, it is 
convenient to introduce the outward-drawn normal instead of 
the positive normal sz’. We denote this normal unit vector by 
7, 80 that 

n= —n', 


and on introducing # instead of 2’ in our formule we have to 
make corresponding changes of sign. We can now express Gauss’s 
theorem in the following form: 


ff [iv Advdyde =f [4.08 =f [ Anas, 


or, if we denote the cosines of the angles which the outward- 
drawn normal # makes with the positive co-ordinate axes by 
ou Oy oz 
an’ on’ On 


[J fleet b+ eddvayiem f f(a +b + oF)as 


As in the case of the plane, we here obtain an intuitive interpretation 
of Gauss’s theorem by taking the vector A as the velocity field of a steady 
flow of an incompressible fluid of unit density. The total mass of fluid 
which in unit time flows across a small surface AS from the interior of R 
to the exterior is given approximately by the expression A,AS, where 
A,, is the component of the velocity vector A in the direction of the 
outward normal # at a point of the surface element. Accordingly, the 
total amount of fluid which flows across a surface S from the inside to 


the outside in unit time is given by the integral f f A,,dS taken over the 
8 


, We can write 


surface. In this interpretation, therefore, the right-hand side of Gauss’s 
theorem represents the total amount of fluid leaving the region R in unit 
time. This amount of fluid is transformed into the integral of the diver- 
gence throughout the interior of the region R. From this we obtain the 
intuitive interpretation of the expression div.d. Since we have taken 
the flow as incompressible and steady, that is, independent of the time, 
the total amount of fluid flowing outwards must be continuously supplied; 
that is, in the interior of the region there must be sources producing a 
(positive or negative) quantity of fluid. The surface integral on the right 
represents the total flow out of the region R; if we divide by the volume 
of the region, we obtain the average flow out of R. If we think of the 
region & as shrinking to a point, so that its diameter tends to zero, in 
other words, if we carry out a space-differentiation of the integral 


is f div.A dzdy dz, we obtain the source-intensity at the point under con- 
sideration. On the other hand, this space-differentiation gives the integrand 
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div.A at that point, and we thus see that the divergence of the vector A is 
the source-intensity of the steady incompressible flow represented by A. 

Particular interest attaches to cases of flow which are source-free, 80 
that fluid is neither created nor annihilated at any point of the region. 
A flow of this type is characterized by the fact that the equation 


da , 0b , & 
ivA=—+—+—=0 
a Ox * oy + dz 
is satisfied everywhere. It then follows that for every closed surface 8 the 


integral over S of the normal component if f A, dS has the value zero. We con- 
8 


sider two surfaces S, and &,, both 

bounded by the same oriented 

curve C in space, which together c 
enclose a simply-connected region é 

of space R, and we apply Gauss’s 

theorem to the region R. For the 

positive normal direction on the 

surface S,, however, we shall take 

the normal pointing towards the 

inside of the region R (as in fig. 11) 

instead of that towards the outside, 

so that the sense of description of 

C in conjunction with the positive Fig. 1 

normal for either surface forms a 

right-handed screw. In Gauss’s theorem, then, we must insert different 
signs for the surfaces S, and S,. We thus obtain 


f f div.A dadydz = f f A, a8 — [ [A,ds. 
zg Sy 8s 


Since, by hypothesis, the left-hand side is zero, we have 


if 4,d8 = [ A, aS. 
Sy 8s 


In words: if a flow is source-free, the same amount of fluid flows in unit 
time across any two surfaces with the same boundary curve. This amount 
of fluid, therefore, no longer depends on the choice of the surface S with 
the closed boundary curve C. It can therefore only depend on the choice 
of C, and the problem arises how the amount of fluid can be expressed 
in terms of the curve C, This question is answered in the next section 
(p. 896) by means of Stokes’s theorem. 


2. Green’s Theorem. 


Just as in the case of two independent variables (p. 366), 
Gauss’s theorem leads to some important consequences, which 
are known as Green’s theorem. 
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We arrive at these formule by applying Gauss’s theorem (in 
vector form) to a vector field A which is given in the special form 


A =ugradv 


and therefore has the components uv,, wv,, uv, Then in R we 
have 


divA = — 2 (wo.) +2 5) +2 ,, (uea)s 
and on the des 


A,= ux. 


Then if we use the familiar symbol 
Av = Vga + Vyy + Vee 


Gauss’s theorem immediately gives us Green’s theorem: 


di f fi Uda + Udy + U,0,) da dy dz 


=— ff fudvdndyde + f fu as. 


Ifwe apply the same argument to the vector field A = v gradu, 
we obtain the formula 


f us J (sste + uy + u,0,)dxdy dz 


=f f fodudcdyde + f fo as 


If we subtract this last formula from the first one, we obtain the 
second form of Green’s theorem, 


Jf [(ude—vdu)dedyde = f f ue _ 9 Zt a8. 


3. Application of Gauss’s Theorem and Green’s Theorem in Space. 
1. Transformation of Au to Polar Co-ordinates. 


If in the second form of Green’s theorem we substitute the special 
function v = 1, we obtain 


ff Audedyde = f fas 
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Just as in the plane, we can use this formula to transform Aw to polar 
co-ordinates (r, @, 6) by choosing for the region R a cell of the polar 
co-ordinate net in space between the co-ordinate surfaces r and r+ h, 
pando-+ k, 0 and 6-1. We obtain 
1 é au a : 
A = {2 ou, sin® £ (2s) ss sind)}. 
oS ae ee rn oe kano) a 

The calculations, which are analogous to those for the plane case (cf. 
p. 369), are left to the reader. 


2. Space Forces and Surface Forces, 


The forces acting in a continuum may be regarded either as space 
forces or as surface forces. The connexion between these two points of 
view is given by Gauss’s theorem. 

We content ourselves by considering a special case, namely, the force 
in a fluid of constant density, say p = 1, in which there is a pressure 
p(x, y, 2) which in general depends on the point (a, y, z). This means that 
on every surface element through the point (2, y, z) the fluid exerts a force 
which is perpendicular to the surface element and has the surface density 
p(x, y, 2). If we consider a region R bounded by the surface S and lying in 
the fluid, the volume R will be subject to a force whose total z-component 
is given by the surface integral 


x=—f fo 5 as, 


where 8x/2n is the cosine of the angle between the x-axis and the outward- 
drawn normal to the surface. In the same way, the y- and 2-components 
of the total force are given by 


oy 

Y=— °F as, 
(es 
Oz 

2=—f [v5 a. 


Gauss’s theorem now gives 


x= ~f f frededy a 
& 

Y= — ff fevdadyae, 
Rg 

Z= ~f f fradudyaz, 
Rk 


- — f f fesdpdady de 
R 


for F, the total force exerted on R. 


and we thus obtain 
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We can express this result as follows. The forces in a fluid due to a 
pressure p(z, y, z2) may on the une hand be regarded as surface forces 
(pressures) which act with density p(x, y, z) perpendicular to each surface 
element through the point (z, y, z), and on the other hand as volume 
forces, that is, as forces which act on every element of volume with volume 
density — grad p. 


EXAMPLE 
1*. Let the equations 
& = 2( Py» Pa» Ds) (i = 1, 2, 3) 
define an arbitrary “orthogonal” co-ordinate system 7, Po, p33 that is, 
Ox; 


if we put a,, = oa then the equations 
‘Pe 


41 Gq + Gy ye + G3 A25 = 0 
415, + GypAgq + 213433 = O 


@o1 4g, + Bye Qgq + Ugg Ay = 0 
are to hold. 


(a) Prove that 


A(X, Ze, Xs) La op oe 
AAP, Po» Ps) vA aes, 


Cy = OF + Oy? + ag4, 


where 


(6) Prove that 


= (c) erie Au = Ugg, + Uae, + Yee, in terms of p,, Pe, Py using 
auss’s theorem. 


(d) Express Aw in the focal co-ordinates ¢,, t,, ¢, defined in Ex. 6, p. 158. 


6. Stoxrs’s THroremM In Space 


1, Statement and Proof of the Theorem. 


In this section we shall give a discussion of Stokes’s theorem 
for any curved surface. We have already (p. 365) met with 
Stokes’s theorem in two dimensions. 

Let C be a closed sectionally smooth oriented curve in space, 
and let S be a surface, bounded by C, whose positive normal is 
continuous or sectionally continuous and in conjunction with the 
sense of description of the boundary curve forms a right-handed 
screw. Further, let B be a vector field defined in a neighbour- 
hood of S, with components 4(z, y, z), (x, y, 2), x(%, y, 2). 
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Stokes’s theorem then states that 
f f(carlB),ds = [ Bids, 
oO 


where the arc s of the curve C increases in the direction in which 
C is described, and B, is the tangential component of B along C. 
Written in full, Stokes’s formula is 


Ox op a 2 op ag 
LE[(E- Bases (B-Z) aeaes (GB) aoa] 
=| (edn + pdy + dz). 


This transforms a surface integral taken over the oriented 
surface S into a line integral taken round the correspondingly 
oriented boundary of the surface. 

The truth of Stokes’s theorem can immediately be made 
plausible by the following train of thought. The theorem has 
already been proved for a plane surface (p. 365). Then if S is a 
polyhedral surface composed of plane polygonal surfaces, so that 
the boundary curve C is a polygon, we can apply Stokes’s theorem 
to each of the plane portions and add the corresponding formule. 
In this process the line integrals along all the interior edges of the 
polyhedra cancel, and we at once obtain Stokes’s theorem for 
the polyhedral surface. In order to obtain the general statement 
of Stokes’s theorem we have only to perform a passage to the 
limit, leading from polyhedra to arbitrary surfaces S and to 
arbitrary sectionally smooth boundary curves C. 

The rigorous performance of this passage to the limit, how- 
ever, would be troublesome; having made these heuristic re- 
marks, therefore, we shall carry out the proof by means of a simple 
calculation. 

If for brevity we put 

A = curl B, 


the components of A are given by 


ox 2 ad 8 a 
ale, y 2) = 5X — b(e, y, 2) = SP — &, ole, y, 2) = % — e 


and (cf. p. 93) 
div A == div curl B= 0. 
14¢ (2912) 
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We take the oriented surface S bounded by the oriented 
curve C and consider the problem of changing the integral 


f [4-48 =f [(adyde+ bdzdz + cdady) 


taken over S into an expression depending only on the boundary 
curve C. To do this, we imagine the surface represented in the 
usual way by two parameters wu, v, so that the surface corre- 
sponds to a closed region D in the wv-plane. By the general 
rule, the transformation of the surface integral to the region D 
gives the expression 


f [fadyae + bdedz + cdady} 
=ff (2-2) dy dz 2 Oy te (#- 9) (22 Ox Ox Hs) 
J Jo\\dy Gz] \Qudv dude dz dudv dudv 


oY _ st) On Oy oy =") dudw 
du dv Bu av ; 


We can transform the expression on the right by collecting the 
terms involving ¢, those in %, and those in x. For the terms 
involving ¢, for example, we obtain 


__ 06 (du dy Oy ea\ Oh (dude oz x). 


dy \Gudv au dv a \dudv du dv 


If to this we add the expression 


__ o¢ (= or (Oe =), 


Oz \du dv Bu dv 


which is identically zero, the terms involving ¢ in the integrand 
are 
Oa (Op dx , Ody , Ofdz\ Ae (APdu , Of dy 24) 
bo Ge oe oye babu) Du Nae Oe oy ae dz dv 


_ dp de a6 de 


du dv Ov Ou’ 
In the same way we obtain the two other terms 
Cys Oy Oe Oy ana Ox Oz Ox dz 


du dv dv Ou du dv dv Ou 
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in the integrand. The double integral is therefore split up into 
the sum of the integrals of the three expressions 


Od, z) Ap, y) Ax, 2) 
O(u, v)’ O(u, v)’ O(u, v)’ 


taken over the oriented region D, whose boundary curve K has 
an orientation corresponding to that of C. Now by Stokes’s 
theorem for two dimensions (cf. p. 364) we have 


Op Ox Og a) ( Ox Ox ) da 
dudv = d — dv) = -- ds, 

i, 5u dv dv bu) i: er OT tag . ee 
where the integrals are to be taken with corresponding orienta- 
tions and the length of arc s on C increases in the direction in 
which the curve is positively described. If we add this formula 


to the two other corresponding ones, we obtain on the left the 
value of the surface integral and on the right the integral 


dx dy dz 
[(eG+ oes xi) a. 


The expression ¢ a xo = +x — however, is just the tangential 


component B, of the vector B in the direction of the oriented 
boundary curve C, and we thus obtain Stokes’s theorem 


f ge (curl B), dS = yf B,ds 


or, oe out in full, 


SING oe ay dz +(#-2 Ox) deda n+ (- 5) aedy} 


== [ (pda + dy + x dz). 


This formula is true provided that the vector A — curl B is con- 
tinuous in the region under consideration and that the surface S 
consists of one or more portions each of which can be continuously 
represented as above by parametric equations r= zu, »), 
y = y(u, v), 2 == 2(u, v) with continuous first derivatives. 
Stokes’s theorem gives the answer to the question raised at 
the end of No. 1 of the preceding section (p. 389). We have seen 
that for a vector field whose divergence is identically zero the 
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integral of the normal component over a surface bounded by a 
fixed curve C depends on the boundary curve C only and not on 
the particular nature of the surface. Since, as we shall prove 
in section 2 of the Appendix (p. 404), every vector field A whose 
divergence is identically zero has the form 


A= curl B, 


Stokes’s theorem enables us to express the surface integral in a 
form which depends only on the boundary. 


2. Interpretation of Stokes’s Theorem. 


The physical interpretation of Stokes’s theorem in three dimensions 
is similar to that already given (p. 371) for Stokes’s theorem in two dimen- 
sions.* Once again we interpret the vector field B as the velocity field of 


a steady flow of an incompressible fluid, and we call the integral if B,ds 
0 


taken round a closed curve C the circulation of the flow along this curve. 
Stokes’s theorem states that the circulation round a curve is equal to the 
surface integral of the component of the curl in the direction of the positive 
norma] to any surface bounded by the oriented curve, the orientation of 
the surface being given by that of the boundary curve. Suppose that we 
apply Stokes’s theorem to a portion of a surface S with a continuously 
turning tangent plane. If we divide this surface integral by the area of the 
portion of surface and then perform a passage to the limit by letting the 
portion of surface and its boundary curve shrink to a point while remaining 
on the large surface S, on the left this process of space-differentiation gives 
us the component of the curl in the direction of the normal at that point 
of the surface to which the boundary curve C has shrunk. We therefore 
see that the component of the curl in the direction of the positive normal 
to the surface is to be regarded as the specific circulation or circulation- 
density of the flow in the surface at the corresponding point, where the 
sense of the circulation and the positive normal together form a right- 
handed screw.t 

If we interpret the vector B as the field of a mechanical or electrical 
force, the line integral on the right-hand side of Stokes’s theorem represents 
the work done by the field on a particle subject to the force when it is 
made to describe the curve C. By Stokes’s theorem the expression for 
this work is transformed into an integral over the surface S bounded by 
the curve, the integrand being the normal component of the curl of the 
field of force. 


* The student should note that in two dimensions Gauss’s theorem and 
Stokes’s theorem differ from one another formally by a sign only, while in 
three dimensions both the intuitive interpretation and the formal nature of the 
two theorems are essentially different. 

+ These considerations also show that the curl of a vector has a meaning 
independent of the co-ordinate system and therefore is itself a vector. 
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From Stokes’s theorem we obtain a new proof for the main 
theorem on line integrals in space (cf. also p. 365, footnote). 
The chief question was, what must be the nature of the vector 
field B if the integral of the tangential component of the vector 
taken round an arbitrary closed curve is to vanish? Stokes’s 
theorem yields a new proof of the fact that the vanishing * of 
this line integral is ensured if the curl of the vector field vanishes. 
The vanishing of the curl or, as we shall say, the irrotational nature 
of a vector field is therefore a sufficient condition—and, as we 
know from section 1 (p. 358), also a necessary one—that the 
line integral of the tangential component of the vector round 
any closed curve shall vanish. In this case the vector field B 
can itself, as we know from section 1 (p. 352), be represented as 
the gradient of a function f(z, y, 2): 

B= grad/f. 
If the vector field B is not only irrotational but also source-free, 


that is, if its divergence vanishes, then the function f satisfies 
the equation 


div grad f = 0, 
or, in full, 
es ace Mes oe 
Ay Ou? a oy? a a2 e 


For the scalar quantity f, which as before we call the potential of 
the vector B, we have Laplace’s equation 


Af = 0, 
which we have already met with (p. 93). 


7. Tar CONNEXION BETWEEN DIFFERENTIATION AND 
INTEGRATION FOR SEVERAL VARIABLES 


It is useful to reconsider, from a single point of view, the facts 
developed in this chapter. 

In the case of one independent variable we regard the reci- 
procal relation between differentiation and integration as the 

* Here, of course, we assume that a surface of the type described above and 
bounded by this curve exists. Since this may lead to difficulties or compli- 


cations—for example in the case of curves with multiple points—the proof of 
the theorem given in section 1 (p. 352) is preferable. 
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fundamental theorem of the differential and integral calculus 
(Vol. I, Chap. II, p. 117). For one independent variable this 
fundamental theorem is as follows: if f(z) is a continuous function 
in the closed region a = ¢ S 6 and if F(z) is a primitive of f(z), 
then 


[fede = FO) — Fea) 


conversely, for every function F(x) with a continuous derivative 
we can construct the corresponding function f(x) = F’(x) in 
the above formula. In the present connexion the essential 
point is the first part of the fundamental theorem, that is, the 
transformation of an integral over a one-dimensional region into 
the expression F(b)-— F(a) depending only on the boundary 
points, which form, as we may say, a region of zero dimensions. 
In other words, if the integrand is given as the derivative of a 
function F(x), the one-dimensional integral can be transformed 
by means of the function F(x) into an expression depending on 
the boundary only. 

The various integral theorems for regions in several dimen- 
sions now give us something analogous to the fundamental 
theorem for one independent variable. The point in question is 
always that of transforming an integral over a certain region lying 
in the region of the independent variables, no matter whether 
this region of integration is a curve, a surface, or a portion of 
space, into an expression that depends only on the boundary of 
the region. For example, Gauss’s theorem in two dimensions is 


f [a ale b,)dedy = f (ady — bd). 


This states that if the integrand of an integral f f ST (a, y)dudy 
g& 


over a closed region & is represented in the form 
F (2, y) = a,(2, y) + b,(z, y), 


then the double integral over the two-dimensional region can be 
transformed into an expression depending only on the one- 
dimensional boundary, namely, into a line integral round the 
boundary curve. Thus Gauss’s theorem reduces the number of 
dimensions of the region of integration by 1. Instead of the 
boundary expression (6) — F(a) considered above, we have a 
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line integral round the boundary of the plane region. Here, of 
course, we cannot speak of a primitive function F. The single 
primitive function is here in a sense represented by the vector 
field with components a(x, y) and b(z, y). On the other hand, 
the application of Gauss’s theorem does require that the integrand 
of the double integral shall be expressed by means of the dif- 
ferentiation process, in fact, as the sum of a derivative with 
respect to 2 and a derivative with respect to y. The requirement 
that the integrand f shall be capable of being expressed in this 
way still allows a great deal of freedom in the choice of the 
primitive vector field (a, 6), whereas for ordinary integrands the 
primitive function F(x) is uniquely determined except for an 
arbitrary additive constant.* 

For the case n= 2, besides Gauss’s theorem and Stokes’s 
theorem, which are essentially equivalent to one another, there 
is yet another generalization of the fundamental theorem, namely 
the main theorem on line integrals (p. 352). Within the two- 
dimensional region we have a closed one-dimensional bounded 
manifold, that is, a portion of curve with two end-points, and 
the problem is that of the reduction of this line integral to an 
expression depending only on the boundary. The main theorem 
on line integrals in section 1 (p. 352) states that this reduction 
is possible if, and only if, the integrand can be represented by 
means of a primitive function U(x, y) in the form 


é grad U, 


where ¢ is the tangential unit vector and the integration is with 
respect to the length of arc s. The value of the integral is then 
given by the equation 


(é: 


A) 
i ¢ grad Uds == O(€, n) — U(Eq, no); 
‘@ "No, 


which obviously corresponds to the state of affairs for n = 1. 


* For a given integrand f(z, y) there are many ways of finding a pair of 
functions a(x, y) and 6(z, y) which satisfy the above equation. For example, 
we can take b(x, y) as identically zero, or as equal to an arbitrary function, and 
then determine the corresponding function a(z, y) in accordance with the 
equation a, = f — b,, choosing for a(x, y) any indefinite integral of the function 
f(x, y) — by(x, y) with respect to x, y acting as parameter. Every other vector 
field which arises by the addition of an arbitrary divergence-free field to the 
vector field found as above is likewise a primitive vector field. 
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The transformation of the line integral 
A (ada + bdy) 
o 


into a boundary expression can therefore be carried out if, and 
only if, the vector A with the components a, b can be represented 
as the gradient of a potential. By comparing this with the 
ordinary fundamental theorem, we see that instead of expressing 
the integrand as the derivative we here express the integrand by 
means of a gradient and that the part played by the primitive 
function is taken by the potential of this gradient. An essential 
difference still remains between this case and the preceding one, 
however, since it is by no means true that the integrand of 
every line integral can be expressed as a gradient in this 
way; on the contrary, this depends on the condition of integra- 
bility a, = 6,. 

When there are three independent variables the conditions 
are very similar. By Gauss’s theorem a triple integral over a 
bounded closed three-dimensional region is transformed into an 
integral over the closed boundary, which is a closed unbounded * 
two-dimensional region enclosed in three-dimensional space. The 
transformation is related to the expression of the integrand of 
the triple integral as the divergence of a vector field (a, b, c), and 
to a certain extent this vector field again plays the part of the 
primitive function. 

With regard to line integrals, the case of three independent 
variables is exactly like that of two independent variables and 
requires no further discussion. 

In the case of three independent variables, the surface in- 
tegral over a two-dimensional region, that is, a surface bounded 
by @ space-curve, occupies a position between the line integral 
and the triple integral. Here the condition for the transformation 
of an integral taken over such a surface into an expression in- 
volving the boundary only is given by Stokes’s theorem in section 
6 (p. 393). The process of differentiation by means of which 
the integrand is constructed in Stokes’s theorem amounts to 
the construction of the curl of a vector field, which here takes 
the place of the primitive function. Here again the situation 

* That is, one having no boundary curve. 


t Just as in the case of two independent variables, there are many different 
ways of constructing a primitive vector field corresponding to a given integrand. 
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resembles that in the case of the line integral. In order that the 
integrand of a surface integral 


f [lady de + bdeda + cdady) 


may be expressible as the normal component of a curl the con- 
dition a, -+ 6, -+ c, = 0 must necessarily be fulfilled. Thus the 
transformation of the surface integral into a line integral is 
not always possible. We may remark that the necessary condition 
stated above is in fact a sufficient condition also.* 

The situation is similar if there are more than three inde- 
pendent variables; we need not, however, discuss this here. 


EXAMPLES 


1. Evaluate the surface integral 
iE } 7 as 
Pp 


r) 
taken over the half of the ellipsoid = + y a = = 1 for which z is posi- 
tive, where a Fe 


1, m, » being the direction cosines of the outward-drawn normal. 
2. Evaluate the surface integral 


f faas 


taken over the sphere of radius unity with centre at the origin, where 
A = ayet + agyt + agzt + Ba,yxty? + Sagy?2* + 3agx*2", 


3*, Prove Gause’s theorem in n dimensions. That is, let B be a region 
in n-dimensional x, ...%,-space and let its boundary 8 be given by an 
equation 

Gay .. +, %_) = 9 


such that G@ <0 in B. Let the functions a,(71,...,%,), where t= 1,...,%, 
be continuously differentiable in B. Then 


jf aah Oat eet Zot) day iy 
= fof (aS + + on Be) as, 


* For the proof of this see section 2 of the Appendix, p. 404. 
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where dS is the element of surface defined in Chap. IV, p. 301, and pa Arar 
v 
are the derivatives of the co-ordinates with respect to the outward 
normal, that is, 
On, G, ; 


YS ae 


Appendix to Chapter V 


1. Remarks on Gauss’s THEOREM AND STOKES’s THEOREM 


In Chapter V we proved Gauss’s theorem and Stokes’s theorem 
by starting with multiple integrals and transforming them by 
simple integrations into boundary integrals. We can, however, 
arrive at the formal expressions of these theorems in the reverse 
way. The corresponding transformations, which in themselves are 
instructive, will be briefly discussed here. 

For example, in order to obtain Stokes’s theorem in the plane 
we. consider two fixed points P and @ in the plane, joined by a 


oe 


ao 


?P 
Fig. 12 


curve C. This curve C, whose points are represented by means 
of a parameter ¢, is supposed to be deformed in such a way that 
during the deformation from its initial position to its final position 
it sweeps out the region R simply. We make this idea analyti- 
cally precise in the following way: let the curve C, which depends 
on the parameter a, be given by the parametric equations 


c= xt, a), Y= y(t, a); tp StsSh, 


where x(t, a), y(tp, a) and x(t, a), y(t, a) are the co-ordinates of 
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the two fixed points P and Q, which are independent of a. We 
suppose that when a describes an interval ay S a S a, the curve 
describes a closed region R. We assume that the functions z(t, a), 
y(t, «) have continuous derivatives with respect to ¢ and a and 
also continuous mixed second derivatives 


ae By _ 


dae Baek 


and, moreover, that everywhere in the region R except at the 


a(x, 


points P and @ the Jacobian a, a) O(@, 9) ; is different from zero, say 


5 a 
positive. Then the region R, except for the pomts P and Q, is 
Ee in a one-to-one way on the rectangle ay S a S a, and 
tp St St, of the at-plane. 

We assume that in the closed region R we are given the two 
functions a(z, y) and (x, y) with continuous partial derivatives, 
and we consider the line integral 


Ha) =f (ole, y)de + (2, y)dy) =f (ame + by dat 


taken along the curve C, corresponding to the parameter a. 
Our object is to investigate how this integral Z(a) depends on 
the variable a. For this purpose we form the derivative 


ty 
2) =/ [(a.%, + AyYa)%¢ + (6,2, + byYa)¥e + AX gs + by,,\dt 


according to the rules for differentiating an integral with respect 
to a parameter. On integration by parts we obtain 


fat byy)dt = (a, + byl — f (oer. + bay) de 
ty 
=— f [(agtet ayy2)0, + (bate+ bay edyJ at; 


the last formula is obtained if we notice that the hypotheses 
assure that z, and y, vanish at t=¢t, and t=. It follows 
that 


ao =! [ay(y.te — Yee) + baltaye — TYa)] a, 
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that is, 
i a (a O(a, 
sas =['@ ay — b,) a(t, a at 


If we integrate this last equation with respect to a between the 
limits ay and a,, we obtain 


oy ) 2, 9) 
Hay) — Mag) =f f (ay — 0) 50 das 


or, if we introduce the independent variables x, y on the right 
instead of ¢, a, 


(ag) ~ Tay) = f [ (be — a4) dedy. 


On the left-hand side, however, we have simply the line integral 
S(adaw + bdy) taken round the boundary curve C,,— C,, of the 
region R, and thus, subject to the assumptions made, we have 
obtained Stokes’s theorem for the plane. 

The reader may be left to deduce Stokes’s theorem in three 
dimensions by the same method. Gauss’s theorem in three 
dimensions may likewise be obtained by starting from a surface 
integral over a bounded surface and deforming this surface in 
such a way that it describes a region R of space. 

It should, however, be pointed out that this way of deducing 
the integral theorems does not give exactly the same results as 
the proofs developed previously. In order to attain the same 
generality we must, e.g., show for Stokes’s theorem in two dimen- 
sions that every region R of the type considered in § 2 (p. 362) in 
the plane can be covered by a family of curves C with the 
required properties of continuity and differentiability. Such a 
proof is possible, but it is so complicated that the previous 
method of proving Stokes’s theorem remains preferable. 


2. REPRESENTATION OF A SOURCE-FREE VECTOR FIELD as A CURL 


In view of the remarks at the end of Chap. V, section 6, No. 1 
(p. 396), we shall investigate whether every source-free vector 
field, that is, every vector field A for which the expression div.A 
vanishes everywhere in a closed region R of xyz-space, can be 
represented by means of a second vector B according to the 


formula yee 
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We shall show that this is actually the case. If a(z, y, 2), 
b(z, y, 2), (x, y, 2) are the components of the vector A, the 
problem is to find a vector B with components U(x, Y, 2), VL; Ys 2), 
w(a, y, z) such that the three equations 
a= Wy—% 
b= Uz — We 
C= Vy — Uy 
are satisfied in R. For the sake of simplicity we assume that 
the region R in which the vector A is defined and satisfies the 
condition a, -+ b, + ¢,=0 is a parallelepiped. We can then 
determine the vector B in many ways, e.g. in such a way that 
its third component w(a, y, 2) vanishes everywhere. If we make 
this assumption, we obtain the equations 
a= —, 
b= u, 
C= Vy — Uy. 


The first equation is satisfied if we put 
: a(z, ¥; f)dé, 


where x and y act as parameters during the integration, and 2, 
and subsequently yg are the z- and y-co-ordinates respectively of 
an arbitrary fixed point of R. To satisfy the second equation 
we put 


u=f ba, y, Oat + ale, 9), 
where a(z, y) is a function of x and y as yet undetermined. 
In virtue of the assumption a, + b, = —c, we can now satisfy 
the third equation also. We first arrive at the equation 
o= vg — ty = —f [ade, 9, 2) + bla, y, OAL — ay(z, y), 
zo 
and thus from a, + b, = —c, we obtain the further relation 


c(z, y, 2) = 7 c,(2, y, ¢) at — a,(z, y) fica (x, y; 2) — (2, ¥ Zo) cag a,(2, Y), 


which we now use to determine the function a(z, y), putting 
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a= —¢c(n, yY; 2a), 
iy 
a=—] c(a, 7, 2) dn. 
f ak 0) dy 
The vector B defined by the functions 


uU = [b¢, yY ¢) at — [a Up Zq) dn, 


v= a a(x, y f)dé, 
w=0 


is a solution of our problem. We at once arrive at the most 
general solution by writing down the three functions 


Powe, 
Ox 
V= jee 
oy 
Wa wi @, 
Oz 


where ®(, y, z) is an arbitrary twice-continuously differentiable 
function. For we see at once that the vector B’= B+ grad® 
with components (U, V, W) satisfies our condition. Conversely, 
if B’ is any vector which satisfies the condition curl B’ = A, we 
must have curl(B’ — B)= 0. Thus the vector B’ — B is irro- 
tational, and by Chap. V, section 1 (p. 352) can be represented 
as the gradient of a function P(z, y, z), so that our statement is 
proved. 


EXAMPLES 


1. Let f(z, y) be a continuous function with continuous first and 
second derivatives. Prove that if 
Sees Fyy as Sey? + 0 


the transformation 
= Jn (2, y)s .= Sy l% y), w= —-2+ xf, (x, y) + YSy (x, y) 
has a unique inverse, which is of the form 


a= G,(u, v), Y= 9% v), Z= —w+t ug,(u, v) + v9,(u, v). 
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2. Represent the gravitational vector field 
x 
VC ae 


as a curl. 


z 


igs ire Ge 
Vet et ae Vere tey 


MIscELLANEOUS EXAMPLES 


1. Let 9, a, and 6 be continuously differentiable functions of a para- 
meter ¢, for 0St<2zx, with a(2m)=a(0), b(2x)=0(0), p(2x) = (0) + 2x 
(n a rational integer), and let x, y be constants. Interpreting the equa- 
tions 

—E=zcosp—ysing+a, n= 2 sing+ y cosp+ 6b 


as the parametric equations (with parameter ¢) of a closed plane curve I’, 
prove that 


bf (Edn — ndi) = Ale + y") + Bet Cy + D, 


where 
A=if dg, B= [(acose + bsing)de, 
T T 


e=f(- sine beotpide,. DE tf (aa— bda). 


2*. Let a rigid plane P describe a closed motion with respect to a 
fixed plane IT with which it coincides. Every point M of P will describe 
a closed curve of IT bounding an area of algebraic value S(21). Denote by 
Qnx (n a rational integer) the total rotation of P with respect to II. Prove 
the following results: 

(a) If x + 0, there is in P a point C such that for any other point M 
of P we have 


S(M) = mn CM? + (C); 


(8) If n = 0, then two cases may arise: (8,) there is in P an oriented 
line A such that for every point M of P 


S(M) = »d(M), 


where d(M) is the distance of M from A and 1 is a constant positive factor; 
or else (8) S(J/) has the same value for all the points VM of the plane P. 
(Steiner’s theorem.) 


$*. A rigid line-segment AB describes in a plane II one closed motion 
of a connecting-rod: B describes a closed counter-clockwise circular motion 
with centre C, while A describes a (closed) rectilinear motion on a line 
passing through C. Apply the results of the previous example to de- 
termine the area of the closed curve in IT described by a point M which 
is rigidly connected to the line-segment AB. 
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4. The end-points A and B of a rigid line-segment AB describe one 
full turn on a closed convex curve I. A point M on AB, where AM = a, 
MB = b, describes as a result of this motion a closed curve I’. Prove 
that the area between the curves I’ and I” is equal to nab. (Holditch’s 
theorem.) 


5*, Prove that if we apply to each element ds of a twisted, closed, and 
rigid curve I’ a force of magnitude ds/p in the direction of the principal 
normal vector (p. 86), the curve [ remains in equilibrium; 1/p is the cur- 
vature of I at ds and is supposed to be finite and continuous at every 
point of I. (By the principles of the statics of a rigid body we have to prove 


that 
[%a=0, [as =0, 
re Tr e 


where denotes the unit principal normal vector of T at ds, and x is the 
position vector of ds.) 


6. Prove that a closed rigid surface 2 remains in equilibrium under a 
uniform inward pressure on all its surface-elements. (If by +’ we denote 
the inward-drawn unit vector normal to the surface-element do and by x 
the position vector of da, the statement becomes equivalent to the vector 


equations 
ff n'do = 0, ff [xr’]do = 0.) 
z 53 


7*. A rigid body of volume V bounded by the surface 2 is completely 
immersed in a fluid of specific gravity unity. Prove that the statical 
effect of the fluid pressure on the body is the same as that of a single 
force f of magnitude V, vertically upwards, applied at the centroid C of 
the volume /. 


8*, Let p denote the distance from the centre of the ellipsoid Z 
e/a? + y/o? + 2/8 = 1 


to the tangent plane at the point P(x, y, z), and dS the element of 
area at this point. Prove the relations 


af f pas = 4nabe, aff, ; dS = Be (b%c® + c®a® + a®2), 


9. An ordinary plane angle is measured by the length of the are which 
its sides intercept on a unit circle with centre at the vertex. This idea can 
be extended to a solid angle bounded by a conical surface with vertex A 
as follows, The magnitude of the solid angle is by definition equal to the 
area which it intercepts on a unit sphere with centre A. Thus the measure 
of the solid angle of the domain x20, y20, 220 is 4n/8 = 1/2. 
Now let I be a closed curve, & a surface bounded by I, and A 
a fixed point outside both I’ and x. An element of area dS at a point 
M of © defines an elementary cone with ite vertex at A, and the solid 
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angle of this cone is readily found by an elementary argument to be 
0088 ag 

r 


where r = AM and 0 is the angle between the vector MA and the normal 
to £ at M. This elementary solid angle is positive or negative according 
as 6 is acute or obtuse. Interpret the surface integral 


6 
a= ff °as 
z 
geometrically as a solid angle and show that 
Q =ff (a — x)dydz + (b — y)dzdz + (c — 2)dxdy 
s fe—s®+O—y+e—a 
where (a, 6, ¢) and (x, y, z) are the Cartesian co-ordinates of A and M 
respectively. 
10. Prove, first directly and then by interpretation of the integral as 
a solid angle, that 


pe I. ore a = 2n. 


11*. Prove that the solid angle which the whole surface of the hyper- 
boloid of one sheet 
x/a? + 2/88 — A/a = 1 


subtends at its centre (0, 0, 0) is 
Nal 2 2 2 gine 
2 fi,] b? cos? + a* sin® > do 
'o NV ab? ++ b%? cos? + a*c* sin* p 
12. Show that the value of the integral 
(a — x)dydz + (b — y)dzdx + (c — z)dudy 
2 [aa + O— y+ 2 
is independent of the choice of the surface %, provided its boundary 
T is kept fixed. By integrating over the outside of the surface, 


prove from this result that if X is a closed surface, then Q = 4x or 0, 
according as A(a, b, c) is within the volume bounded by = or outside this 


volume. 
13%. Let the surface & be bounded by the closed curve I’ and consider 
the integral 


a= 


21a, b, ) ={fe — a)dydz+ (b— a +(e 2)dady 


(7 = (a — a} + (6— y® + (€— 2"), 
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as a function of a, 6, c. Prove that the components of the gradient of QO 
can be expressed as line-integrals as follows: 


0Q_ ff (e—c)\dy—(y—b)dz 02 (xv — c)dz — (2— c)dx 
ie eer cee wa 


> 


P= a ro 
a _ ¢ (y— b)de — (w— a)dy 
ac iE ad . 


(These formule, which have an important interpretation in electro- 
magnetism, can be expressed. by the following vector equation 
[x . dx] 


ad = — 
gr fering 


where x is the vector with components (7 — a), (y — b), (z — c).) 
14*. Verify that the expression 


— 4aydz + 2(22 — y* — 1)dy 
(2 + y? — 1)? + 4y? 


is the total differential of the angle which the segment —1 S# <1, y= 0, 
subtends at the point (x, y). Using this fact, prove the following result 
by a geometrical argument: 

Let I be an oriented closed curve in the zy-plane, not passing through 
either of the points (—1, 0), (1, 0). Let p be the number of times T' 
crosses the line-segment —1 << «<1, y= 0 from the upper half-plane 
y > 0 to the lower half-plane y <0, and » the number of times [ 
crosses this line-segment from y <0 to y>0. Then 


= fay dx + (2° — y* ~ 1)dy _ = 
a car” ela 


Thus if [' is the curve r= 2 cos26 (0 3 6 S 2), in polar co-ordinates, 
o=0. 
15 **, Consider the unit circle C 
x’ = cos, y =sing, 2°?=0 (0S 9S 2n) 


in the zy-plane. Denote by Q the solid angle which the circular disc 
2+ y? <1, z= 0, subtends at the point P = (a, y, z). Now let P de- 
scribe an oriented closed curve I’ which does not meet the circle C. Let 
p be the number of times I crosses the circular disc 22 + y2 <1, 2=0, 
from the upper half-space z > 0 to the lower half-space z <0, and n 
the number of times I crosses this disc from z <0 toz > 0. If P starts 
from a point Py on T with = Q,, then P, describing T (while Q varies 
continuously with P), will return to Py with a value Q= Q,. Prove by 
a geometrical argument that 


2 — Q5 = [dO = 4n(p— n), 


Vv) EXAMPLES qn 


Using the vector equation found above, 
Aas f {PP’ dP’ 
Bree dG | PP’ ig 
(example 13), prove that 
a —ax dx da’ 
f a : ‘—y dy dy’ 
=, y = 
af | PP’ 2a—2z dz dz’ 
f f (x! —2)(dy dz’—dzdy’) + (y’—y)(dz da’ —da dz’) + (2’—2)(dady’— dy da’) 
nae [(a’ —2)*-+(y’—y)*+ (2’—2)*P” 
= 4r(p ~ n). 
(This repeated line-integral, which is due to Gauss, gives the number of 
times I’ is wound around @. It should be remarked that its vanishing is 


Fig. 13 


necessary if the two curves I’ and C (thought of as being two strings) are 
to be separable, but not sufficient, as is shown by the example in fig. 13, 
where p = n = 1, yet I’ and C cannot be separated.) 


CHAPTER VI 


Differential Equations 


We have already discussed special cases of differential equa- 
tions in Vol. I, Chap. XI. We cannot attempt to develop the 
general theory in detail within the scope of this book. In this 
chapter, however, starting with further examples from mechanics, 
we shall give at least a sketch of the main principles of the subject. 


1, Tae DirrerentiaL Equations oF THE MOTION OF A 
PaRTICLE IN THREE DIMENSIONS 


1. The Equations of Motion. 


In Vol. I, Chap. V, sections 4, 5 (p. 292), and Chap. XI (p. 502), 
we have already discussed the motion of a particle; we made 
the assumption, however, that this motion takes place along a 
pre-assigned fixed curve. We now drop this restriction and consider 
a mass m which we suppose concentrated at a point with co- 
ordinates (x, y, z). The position vector from the origin to the 
particle has components 2, y, z and we denote it by x. A motion 
of the particle will then be represented mathematically if we can 
find an expression for 2, y, 2 or x as a function of the time ¢. 
If, as before, we denote differentiation with respect to the time 
t by a dot, then the vector # with components (¢, #, 2) and 
absolute value v = +/(£? + 9? -+ 2*) represents the velocity and 
the vector # with the components #, 7, Z represents the accelera- 
tion of the particle. 

We shall not deal with the foundations of mechanics, but 
take the following definitions and facts as our starting-point: 
we call the product of the acceleration vector * and m the force 
vector f, and accordingly write 

mst =f. 
412 
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The components of this force vector, or, as we briefly say, of the 
force, will be denoted by 


mé= X, 
mij = Y, 
mi = Z. 


These three equations are known as Newton’s fundamental equa- 
tions of mechanics. From our preliminary point of view they 
represent nothing but a pure definition of the word force. It 
turns out, however, that in many cases this force vector can 
be determined without reference to the particular motion to 
be studied, a force field in space being previously known from 
physical assumptions. We can then regard the fundamental 
equations from quite a different point of view. They then represent 
conditions which must be satisfied by the acceleration in every 
particular motion if this motion takes place under the influence of 
the given field of force. 

One example of such a field of force is the field of gravity. If we take 
gravity as acting in the direction of the negative z-axis, we know the com- 
ponents of the force to begin with. They are : 

X=0, Y=0, Z=—mg, 
or, in vector notation, 
= —mg gradz, 
where g is the constant acceleration due to gravity (cf. Vol. I, Chap. V, 
section 4, p. 294). 

Another example is given by the field of force produced by a mass pu 
concentrated at the origin of the co-ordinate system and attracting accord- 
ing to Newton’s law. If r= V(x? + y?-+ 2) =| | ia the distance of 
the particle (a, y, z) with mass m from the origin, then in this case the field 
of force is given by the expression 


S= yumy grad > 
(cf. p. 91), and Newton’s fundamental equations are 
% = py grad * 
or, in components, - . 
z= — ERY ay 
g=—ur% 


a 

ll 

| 

= 

-< 
4s 
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In general, if f£ is a given field of force, with components 
X(a,y,2), Y(z, y, 2), Z(x, y, 2) which are functions of position, 
the equations of motion 


mit = f 
or 

mé= X, 

mj= Y, 

mi = Z 


form a system of three differential equations for the three unknown 
functions 2(t), y(é), 2(t). The fundamental problem of the me- 
chanics of a particle is that of the determination of the actual 
path of the particle from the differential equations, when at the 
beginning of the motion, say at the time ¢ = 0, the position of 
the particle (that is, the co-ordinates r= 2(0), Y= y(9), 
%y = 2(0)) and the initial velocity (that is, the quantities 4, = <(0), 
Yo = (0), 29 = 2(0)) are given. The problem of finding three 
functions which satisfy these initial conditions and also satisfy 
the three differential equations for all values of t is known as the 
problem of the solution or integration * of the system of differential 
equations. 


2. The Principle of the Conservation of Energy. 

Before we consider the integration of this system of dif- 
ferential equations in special cases, we shall state a number of 
general facts following from the equations of motion. The concept 
of the work done on the particle by the field of force during the 
motion was mentioned earlier (Chap. V, section 1, p. 350); we 
know that this work is given by the line integral 


f[ seat = [(Xde + Ydy + Zdz) 
taken along the path described by the particle. 
If the field of force can be represented as the gradient of a 
potential, say 
S= grad®, 


the work done during the motion is independent of the path 


* This word is used because the solution of such differential equations may 
to a certain extent be regarded as a generalization of the process of ordinary 


integration. 
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and depends only on the initial and final points of the path (cf. 
Chap. V, section 1, p. 350). A field of force which can be repre- 
sented as the gradient of a potential is called, following Helmholtz, 
a conservative * field of force. In such a field of force the equations 
of motion may be written in the vector form 


mx = —grad U, 


where instead of the potential ©, which, it may be pointed out, 
is incompletely determined in that it contains an arbitrary 
additive constant, we introduce the potential energy U = —®. 
In terms of the components the last equation becomes 


ME = —U,, 
my = —U,, 
méi= —U,. 


Although in general we cannot integrate this system of equations, 
we can deduce another equation from it in which the second 
derivatives do not occur and only the first derivatives of the 
functions x(), y(t), z(t) appear. If we use the vector notation, 
the argument may be carried out as follows. In the equation 
md = —grad U, we form the scalar product of both sides and x. 
The left-hand side then becomes the derivative of the expression 
1m? = 1mv* with respect to ¢, the right-hand side is the deri- 
vative of the function —U with respect to ¢ (cf. p. 71), and by 
integration we therefore obtain 


dmx? = —U + ¢, 
where c is a constant, that is, a quantity independent of the 
time ¢. If we wish to avoid using vector analysis, then we may 
arrive at the same result by multiplying the three equations of 


motion by ¢, y, 2 respectively and adding; on the left-hand side 
we then have the derivative of the quantity 


gm(e + 9 + 2) 
with respect to ¢. The equation 
im(e + P+ #)-+ U0 =e 


* “ Conservative ” in virtue of the theorem of the conservation of energy 
which we shall shortly deduce. 
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thus found is the mathematical expression of the theorem of the 
conservation of energy. We call the expression 


T = Ym(a2 + 9 + 22) = Jmot 


the kinetic energy (or energy of motion) of the moving particle, 
and the quantity U the potential energy (or energy of position) 
of the particle. Without going into the physical explanation of 
these concepts, we may mention that our equation has the 
following meaning: 

In the case of motion in a conservative field of force the total 
energy, that is, the sum of the potential energy and the kinetic 
energy, remains constant. 

The way in which this theorem can be used in the actual 
solution of the equations of motion will be shown in the examples 
in the next section. 


3. Equilibrium. Stability. 


The equations of motion, in conjunction with the assumption 
that f= —grad U, ie. that the field of force is conservative, now 
enable us to discuss the problem of equilibrium. We say that 
the particle is in equilibrium under the influence of the field of 
force if it remains at rest. In order that this may be the case 
its velocity and its acceleration must both be zero throughout the 
interval of time under consideration. The equations of motion 
therefore give the equations 

grad U = 0 
or 
U,=0, U,=0, U,=0 


as the necessary conditions for equilibrium. 

These same equations determine the points at which the poten- 
tial energy U has a stationary value. It is particularly interesting 
to find that a point at which the potential energy U has a proper 
minimum is a point of stable equilibrium. By stability of equili- 
brium we mean that if we slightly disturb the state of equilibrium 
the whole resulting motion will differ only slightly from the state 
of rest.* More precisely, let R and p be any positive numbers. 

* An example is given by a particle which rests under the influence of gravity 
at the lowest point of a spherical bowl which is concave upwards. On the 
other hand, a particle resting at the highest point of a spherical bowl which is 


concave downwards is in “ unstable” equilibrium; the slightest disturbance 
results in a large change of position. 
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Corresponding to R and p we can find two positive numbers € 
and § so small that if the particle is moved a distance not more 
than ¢ from the position of equilibrium and started off with a 
velocity not greater than 5, then in the whole subsequent course 
of the motion the point never reaches a distance greater than R 
from the point of equilibrium and never has a velocity greater 
than p. 

It is a remarkable fact that we can prove this statement about 
stability without integrating the equations of motion. In the 
proof we need only use the assumption that at the position of 
equilibrium in question the potential energy U has a proper 
minimum. For simplicity we assume that the position of equi- 
librium, the point where U has a minimum, is the origin; if not, 
we can make this point the origin by translation of axes. By 
definition the potential energy U involves an arbitrary additive 
constant; for the function U and the function (U + const.) give 
the same field of force, the constant disappearing in the process 
of differentiation. Thus without loss of generality we may take 
the value of the minimum U(0, 0, 0) as zero. 

About the origin we describe a sphere S, with radius r; re- 
calling the assumption that U is a minimum, we choose r< R 
so small that everywhere in the interior and on the surface 
of this sphere, except at the origin, the inequality U>0 is 
satisfied. The least value of U on the surface of the sphere we 
call a; by hypothesis, a is positive. It is therefore certain that 
the particle can never reach the surface of the sphere S, as long 
as its potential energy remains less than a. Since U is continuous, 
we can find an ¢, depending on a, so small that in the sphere 
S, with radius ¢ about the origin the value of U is at most $a. 
If we start the particle from a point of S,, and give it an initial 
velocity v, so small that for the initial kinetic energy we have 


Ty = pmo? < 3a 
(in other words, if | v9|<+/(a/m)), then by the law of the 
conservation of energy we always have 
T+U=T),+ U9 <a. 
Since T is always equal to or greater than zero, we shall always 


have U less than a, and therefore the particle can never reach a 


distance greater than r from the origin. Since U remains greater 
15 (2912) 


418 DIFFERENTIAL EQUATIONS [Cuap. 


than or equal to zero, T remains less than a throughout the whole 
motion, and for the velocity we always have v << 4/(2a/m). In 
virtue of the continuity of U, a tends to zero with r. We can 
therefore choose 7 so small that 4/(2a/m) < p (that is, a < $p*m), 
so that the velocity is always less than p. Thus if the point 
starts inside S, with velocity v9, and if | |< ~+/(a/m), it 
always remains within the sphere S, of radius r << # and always 
has a velocity less than p. 


9. EXAMPLES ON THE MECHANICS OF A PARTICLE 


1. Path of a Falling Body. 


As a first example we shall consider the motion of a particle under the 
influence of gravity, taken as acting parallel to the negative z-axis. Newton's 
equations of motion take the form 


méi=0, mj=0, mzi= —mg; 
that is, 

Ci ay i 

i ~ 0, aa 0, wae g 


From these equations by integration we find first the corresponding com- 
ponents of the velocity, and then the co-ordinates of the particle itself. 
We at once obtain 


dx dy dz 
=, =b, 2=— ; 
a °° aw” dt gt ey 


where a}, b,, ¢, are constants; a second integration gives the equations 


@ == Al + Ae, 
y = byt + by, 
z= —$gl? + et + Co, 


where a», bg, ¢, also represent constants. The meaning of the six constants 
of integration is found from the initial conditions. Without restricting the 
generality of the mechanical problem, we can choose the co-ordinates in 
such a way that at the time ¢ = 0 the particle is at the origin. Accordingly, 
if we put ¢= 0 and at the same time 2 = y = z = 0 in the last equations, 
we at once obtain a, = b, = c¢,= 0. Moreover, we can assume without 
loss of generality that the initial velocity lies in the xz-plane, so that the 
component 6, of the initial velocity has the value zero. With these assump- 
tions the equation y(t) = 0 will hold for all values of ¢. The trajectory 
(that is, the path of the particle) therefore lies in a fixed plane, namely, 
the 2z-plane. If we eliminate the time ¢ from the remaining equations 


zw=at, z= —Fgl? + et, 
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we obtain the equation of the trajectory in the form 


yee a 
z 2a,A a? + Ps a. 
This curve is a parabola, with its axis parallel to the z-axis and its vertex 
upwards, The co-ordinates of the vertex, which correspond to the maximum 
of the function z, are found by equating the derivative of the right-hand 
side of our equation to zero. For the co-ordinates (x, z) of the vertex we 
thus obtain the values 


a ee i 
2a? goa g 2% 
The time T at which the highest point of the path is reached is determined 
by the equation 


After twice this time, that is, ¢ = 2c,/g, the mass has reached the point 
with co-ordinates z = 2a,c,/g and z == 0, and thus lies on the horizontal 
line y = z = 0 through the initial point. 


2. Small Oscillations about a Position of Equilibrium. 


In section 1, No. 3 (p. 416) we considered the question of the 
stability of equilibrium. The motion of a particle about a position 
of stable equilibrium, corresponding to a minimum of the potential 
energy, can be approximated to in a simple way. For the sake 
of brevity we restrict ourselves to a motion in the zy-plane and 
assume that there is no force acting in the direction of the 
z-axis. We imagine the potential energy in the neighbourhood 
of the origin (which we take at the minimum) expanded by 
Taylor’s theorem in the form 


U= Uy + pot gy + 3(aa* + ay + cy?) +... . 


Here p, g and a, 6, ¢ denote the values of the derivatives 
U,, U, and U,,, Usy, Uy, respectively at the origin. In virtue 
of the assumption U, = 0, and since U,(0, 0) = 0, U,(0, 0) = 0, 
the constant term and the linear terms in this expansion dis- 
appear. We now assume that, corresponding to the fact that 
the origin is a minimum, the quadratic terms 


Q(z, y) = ¥(ax* + bay + cy?) 
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form a positively definite quadratic form (p. 205), and that in a 
sufficiently small neighbourhood of the position of equilibrium the 
potential energy U can be replaced with sufficient accuracy by 
this quadratic form Q. With these assumptions the equations 
of motion take the form 


mx = —gradQ 
or 

me = —ax — by, 

my = —ba — cy. 


These can easily be integrated completely if we first rotate the 
x- and y-axes through a suitably chosen angle. For if we consider 
the positively definite form ax* +- 2bxy + cy? = 2Q, we know 
from elementary analytical geometry that by rotating the axes 
through a suitably chosen angle ¢, that is, by making the sub- 
stitution 

z= €cosd — 7 sing, 

y = Esing + 7 0084, 


this expression can be transformed into an expression of the form 
ag? + Bn? = 2Q, 


where ¢ and 7 are the new rectangular co-ordinates and a and 8 
are positive numbers.* In these new co-ordinates the equations 


of motion m% = —gradQ transform into 
mé = —ak, 
mi = — Bn, 


where , 7 are the new components of the position vector x. 
As in Vol. I, Chap. V, section 4 (pp. 296-7), both these equations 
can be integrated completely. We obtain 


f= Aysing/* ¢— a), 
m 
n= Again ae (t — e), 
m 
where ¢,, ¢,, 4,, A, are constants of integration which enable us 


* For the equation Q = 1 represents an ellipse, and by suitable choice of ¢ 
the term in zy can be removed. 
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to make the motion satisfy any arbitrarily assigned initial con- 
ditions. 

The form of the solution shows that the motion about a 
position of stable equilibrium results from the superposition of 
simple harmonic oscillations in the two “ principal directions ee 
the &-direction and the 7-direction, the frequencies of these 
oscillations being given by +/(a/m) and +/(B/m). A general dis- 
cussion of these oscillations, which we shall not carry out here, 
shows that the resultant motion may take a great variety of 
forms. 

24 r 


? 7 


Figs. 1~3.—Oscillation diagrams 


To give a few examples of these compound oscillations we first consider 
the motion represented by the equations 


&=sin(t + ¢), 


7 = sin(é — c). 
By eliminating the time ¢ we obtain the equation 
(& + 7)? sin?c + (E — ny)? cos?e = 4 sinc cos*e, 


which represents an ellipse. The two components of the oscillation have the 
same frequency 1 and the same amplitude 1, but a difference of phase 2c. 
If this difference of phase successively takes all values between 0 and 7/4, 
the corresponding ellipse passes from the degenerate straight-line case 
—&—7= 0 to the circle &2-+ ny? = 1, and the oscillation passes from the 
so-called linear oscillation to the circular (cf. figs. 1-3). 
If as a second example we consider the motion represented by the 

equations 

& = sint, 

1 = sin 2(t — e), 


where the frequencies are no longer equal, we obtain oscillation diagrams 
which are decidedly more complicated. In figs. 4, 5, and 6 these figures 
are given for the phase differences c = 0,c = 1/8, and c = 1/4 respectively. 
In the first two cases the particle moves continvously on a closed curve, 
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but in the last case it swings backwards and forwards on an arc of the 
parabola y= 2@?— 1. The curves obtained by the superposition of 


v4 2 4 


1 1 1 


Figs. 4~6.—Oscillation diagrams 


different simple harmonic oscillations in directions at right angles to one 
another are given the general name of Lissajous figures. 


3. Planetary Motion. 


In the examples discussed above the differential equations 
of the motion can immediately (or after a simple transformation) 
be written in such a way that each of the co-ordinates occurs 
in one differential equation only and can be determined by 
elementary integration. We shall now consider the most im- 
portant case of a motion in which the equations of motion are 
no longer separable in this simple way, so that their integration 
involves a somewhat more difficult calculation. The problem 
in question is the deduction of Kepler’s laws of planetary motion 
from Newton’s law of attraction. We suppose that at the origin 
of the co-ordinate system there is a body of mass p (e.g. the sun) 
whose gravitational field of force per unit mass is given by the 
vector 1 
S= ye grad —. 


What is the motion of a particle (a planet) under the influence 
of this field of force? The equations of motion are 


ss 1 
t= “Se 
CERIEEAIr 
Y= YP Os? 
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In order to integrate them we first state the theorem of con- 
servation of energy for the motion in the form 


dn(a? + y+ #) — HEN _ , 


where C is constant throughout the motion and is determined 
by the initial conditions. 

From the equations of motion we can now deduce other 
equations in which only the components of the velocity, not the 
acceleration, are present. If we multiply the first equation of 
motion by y, the second by x, and subtract, we obtain 


ty — nj =0, or © (dy — ya) = 0, 


whence by integration we have 
ry — yt = Cy. 
Similarly, from the remaining equations of motion we obtain * 


Ye — 24 = Co, 
2b — LE Cy. 


These equations enable us to simplify our problem very 
considerably in a way which is highly plausible from the intuitive 
point of view. Without loss of generality we can choose the 
co-ordinate system in such a way that at the beginning of the 
motion, that is, at ¢ = 0, the particle lies in the zy-plane and 
its velocity vector at that time also lies in that plane. Then 


* We can also arrive at these three equations using vector notation, if we 
form the vector product of both sides of the equation of motion and the position 
vector x. Since the force vector is in the same direction as the position vector, 
we obtain zero on the right, while the expression [x] on the left is the 
derivative of the vector [x2] with respect to the time. It therefore follows 
that this vector [xx] = c has a value which is constant in time; this is 
exactly what is stated by the co-ordinate equations above. 

As we see, this equation does not depend on our special problem, but holds 
in general for every motion in which the force has the same direction as the 
position vector. 

The vector [xx] is called the moment of velocity and the vector m[x2] 
the moment of momentum of the motion. From the geometrical meaning of the 
vector product we easily obtain the following intuitive interpretation of the 
relation just given (cf. the subsequent discussions in the text). If we project 
the moving particle on to the co-ordinate planes and in each co-ordinate plane 
consider the area which the radius vector from the origin to the point of pro- 
jection sweeps over in time #, this area is proportional to the time (theorem 
of areas). 
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2(0) = 0, and 2(0)= 0; and by substituting these values in the 
above equation and remembering that the right-hand sides are 
constants, we obtain 

xy — ye= o =h, 
yz oT zy = 0, 
2h — 2% = 0. 


From these equations we conclude in the first place that the whole 
motion takes place in the plane z = 0. Since we naturally exclude 
the possibility of a collision between the sun and planet, we may 
assume that the three co-ordinates («, y, z) do not vanish simul- 
taneously, so that at the time t= 0 at which 2(0) = 0 we have, 
say, 2(0) + 0. Now from the last of the three equations above it 
follows that 


Therefore z = az, where a is a constant. If we put ¢= 0 here, 
then from the equations 2(0) = 0 and 2(0) + 0 it follows that 
a= 0, so that z is always zero. 

We may therefore base our problem of integration on the two 
differential equations 


: « m 
ym(a + gt) — TE = 0, 
xy — ys = h. 


We next use the equations z= rcos0, y= rsin@ to transform 
the rectangular co-ordinates (x, y) into the polar co-ordinates 
(r, 6), which are now to be determined as functions of t. Since 


a2 gf? — 24 7262, 
and : 
Ly — yt = 7°78, 
we have the two differential equations 
Lani? 1262) — YE” _. Q, 
rT 
r6—=h 


for the polar co-ordinates r, 6. The first of these equations is the 
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theorem of the conservation of energy, while the second expresses 
Kepler’s law of areas. In fact (cf. Vol. I, Chap. V, section 2, 
pp. 273, 275) the expression 476 is the derivative with respect 
to the time of the area swept out in time ¢ by the radius vector 
from the origin to the particle. This is found to be constant, 
or, as Kepler expressed it, the radius vector describes equal areas 
in equal times. ; 

If the “area constant” h is zero, 8 must vanish, that is, 0 
must remain constant, so that the motion must take place on a 
straight line through the origin. We exclude this special case 
and expressly assume that h += 0. 

In order to find the geometrical form of the orbit, we give 
up thinking of the motion as a function of the time * and consider 
the angle 6 as a function of 7, or 7 as a function of 6, and from 
our two equations we calculate the derivative dr/d@ as a function 
of r. 

If we substitute the value 6 = h/r? from the area equation 
in the energy equation and recall the equation 


= Vu & 6, 
i a 
we at once obtain the differential equation of the orbit in the 
form 
m [h? (dr\? | h? yum 
2 (7 (3) | ao 
or 


=) ge et med :) 
dé mh? r py 


To simplify the later calculations we make the substitution 


: 
u 


f= 


* The course of the motion as a function of the time can be determined 
subsequently by means of the equation 


f *8d0 = A(t — &), 


in which we suppose that 7 is known as a function of @ (cf. p. 428). 
15° (8912) 
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and introduce the following abbreviations: 


Love 
pW 
2 
fa14 208 
my? 


The above differential cae then becomes 


(i) = a -(« is ae 


and this can be integrated immediately. We have 
du 


alas) V(2/p? — (u— 1]p)*) 


or if for the moment we introduce u — — v as @ new variable, 
Pp 


wo 
= |S 
; J V(e?/p® — 0) 
For the integral (by Vol. I, Chap. IV, section 2, p. 213) we obtain 
the value arc sin”, and we thus obtain the equation of the 
€ 


orbit in the form 


LoVe een(b 0.) 
r Pp Pp 

The angle @, can be chosen arbitrarily, since it is immaterial from 

which fixed line the angle 6 is measured. If we take 0) = 7/2, 

that is, if we let v = 0 correspond to the value @ = 7/2, we finally 

obtain the equation of the orbit in the form 


P 


Or TS econ: 


We shall assume that the student already knows from analytical 
geometry that this is the equation in polar co-ordinates of a 
conic having one focus at the origin. 

Our result therefore gives Kepler’s law: the planets move in 
conics with the sun at one focus. 
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It is interesting to relate the constants of integration 
h? 2Ch? 
ee= en 


to the initial motion. The quantity p is known as the semi-latus 
rectum or parameter of the conic; in the case of the ellipse and 
the hyperbola it is connected with the semi-axes a and 6 by the 
simple relation 
p= . 
The square of the eccentricity, «2, determines the character of the 
conic; it is an ellipse, a parabola, or a hyperbola, according as 
e” is less than, equal to, or greater than 1. 
From the relation 
2Ch? 
2—] pacts 
: - my? 
we see at once that the three different possibilities can also be 
stated in terms of the energy constant C; the orbit is an ellipse, 
a parabola, or a hyperbola, according as C is less than, equal to, 
or greater than zero. 


If we suppose that the particle is brought at time t= 0 to the 
point x, in the field of force and is there started off with an initial velocity 
Xo, then the reiaiion 
ym 


é 2 
C = imu, 
To 


gives the surprising fact that the character of the orbit—ellipse, parabola, 
or hyperbola——does not depend on the direction of the initial velocity at 
all, but only on its absolute value vp. 


Kepler’s third law is a simple consequence of the other two. 
It states that in elliptic orbits the square of the period bears a 
constant ratio to the cube of the major semi-axis, the ratio depending 
on the field of force only and not on the particular planet. 

If we denote the period by T and the major semi-axis by a, 


we should then have 
T2 
a = const., 


where the constant on the right is independent of the particular 
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problem and depends only on the magnitude of the attracting 
mass and on the gravitational constant y. 

To prove this we use the theorem of areas in the integrated 
form 


f 2d0 = Ait ~ te), 
8 


which defines the motion as a function of the time. If we take 
the integral over the interval from 0 to 2a, we obtain on the 
left-hand side twice the area of the orbital ellipse, that is, by 
previous results, 2aab, while on the right-hand side the time 
difference ¢ — ¢) must be replaced by the period 7. Therefore 


Qrab= AT or 4a°ab? = h?T?, 
We already know that h? is connected with the elements a and b 
of the orbit by the relation h2/yyp = p = b2/a. If we replace h? in 
2 
the above equations by ia yp, it follows at once that 
a 


which exactly expresses Kepler’s third law. 


EXAMPLES 
1. Prove that as t-» © the velocity +/%* of a planet tends to 0 if its 
orbit is a parabola and to a positive limit if it is a hyperbola. 


2*, A planet is moving on an ellipse, and w = a(t) denotes the angle 
PMP,, where P is the position of the planet at the time ¢, P, its position 
at the time ¢, when it is nearest to the sun S, and JM the centre of the ellipse. 
Prove that w and ¢ are connected by Kepler’s equation 


h(t — t,) = ab(w — esinw). 


3. Prove that a body attracted towards a centre O by a force of mag- 
nitude mr moves on an ellipse with centre O. 


4. Prove that the orbit of a body repelled by a force of magnitude 
f(r), where f is a given function, from a centre O is given in polar co- 


ordinates (7, 6) by 
a me a /(% one iy 
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5. Prove that the equation of the orbit of a body repelled with a 


u 2 
force A from a centre O is 


l cos(x@ +e) for p< ht 
htx 


1 
u f a cosh(x8 + s) for p> i 
h?x 


x=4/(l1— 5 |) and ¢ is a constant of integration. 


3. Furrser Exampies oF DirFERENTIAL EQuaTIons 


Before discussing the foundations of the theory of differential 
equations, which we shall do in the next section, we shall here 
consider some further examples of problems involving differential 
equations, also arising in part from mechanics. 


1. The General Linear Differential Equation of the First Order. 


In Vol. I, Chap. III, section 7 (pp. 178, 182) we have already 
integrated the equation y’ + ay + 6 = 0 completely in the case 
where a and 6 are constants. We can, however, also completely 
integrate this “linear differential equation of the first order ” * 


y +ay+b=0 


for the unknown function y(z) in the general case where a and b 
are any continuous functions of x. The solution is obtained by 
means of the exponential function and ordinary integration 
(which, however, cannot in general be performed in terms of 
elementary functions). 

We first suppose that b= 0. Then the differential equation 
can be put in the form 


provided that y + 0. From this it follows that 
log] y| = — fala)dz, 


* The word “linear” expresses the fact that the unknown function and 
its derivatives are only linearly involved in the differential equation. A differen- 
tial equation is said to be “ of the first order ” when it contains first derivatives 
only and no higher derivatives. 
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and finally, if for brevity we denote any indefinite integral of the 
function a(x) by A(z), 


y= ce A@), 


where c is an arbitrary constant of integration. This formula 
gives a solution even when c = 0, namely y = 0. 

If now 6(x) is not equal to zero, we attempt to find a solution 
of the form 

y = uaje-4, 

where u(x) must be suitably determined.* 

Since A’(x) = a(z), 

y = uw (ae 4 — u(x)a(x)e“4™, 
and for the unknown function u(x) we therefore have the dif- 
ferential equation 
u (ae 4 — —b, 
from which it follows that 
ua) = — ui b(x)e4 da. 
The expression 
y(a) = —e4® (B(x) de, 

where 


A(z) = f{ a(x) dx 


therefore gives a solution of the differential equation. This solution 
is formed from known functions by means of the exponential 
function and of ordinary processes of integration only. Since the 
function u(x) involves an arbitrary additive constant, we see that 
the expression 


y(2) = €4(c — [o(a)edz), 
where 


A(z) = f a(x) da, 


gives a solution which still contains an arbitrary constant of 
integration c. This solution really contains only one arbitrary 
constant, although A(x) also involves an additive constant. For 
if we replace A(x) by A(x) + ¢, the solution becomes one of 
similar type obtained from the original solution by replacing 
by ce, 


* This device is known as “ variation of the parameter ” (see also p. 445). 
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For example, in the case of the differential equation 


y¥ +a+ez=0 
we have 


A(a) = | xdx = 42%, A e4@)b(x) da = fi cer l2@ da = ex'l2, 


and hence the solution 
y = e-*2(¢ — eX"/2) = ce-*7/2 — 1, 


as we may verify by diiferentiation. 


2. Separation of the Variables. 


The idea which underlies the above solution is that of 
separation of the variables. If a differential equation is of the 
form 

1 __ (2) 


= — Fay 


where a depends on x only and f on y only, it may also be 
expressed symbolically by 
adz-+ Bdy=0 
or 
adx = —Bdy, 


in which the variables x and y are separated. Introducing the 
two indefinite integrals 


A= fade, B= — | Bdy, 
which are obtained by ordinary quadratures, we at once obtain 


d 
5 (4— B= a+ By =0, 


that is, 
A—B=ec, 


where c is an arbitrary constant of integration. This equation 
may now be imagined as solved for y, and the required solution 
is thus obtained by quadrature. 

Another example in which the same idea is applied is the 
so-called homogeneous differential equation 
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If we take z==y/z, so that y’=az’+z, the differential 
equation becomes 


az’ + 2= f(z), 
or 
== f@) x - 
x 


an equation for z in which y does not appear explicitly. Hence 


where ¢ is an arbitrary constant of integration. Using this equa- 
tion to express 2 as a function of x, we obtain the required solution. 


Examples.—From y’ == Y we at once have 
x 


dy a 
y 
the solution of which is 
log y | = 6 
x 
Again, the equation 
Pe i 
: x 
gives 
dz z—1 
: = | = ] : 
is 8 es log] 
hence 
ee 
ame ey 
where & is a constant. 
EXAMPLES 


1, Integrate the following equations by separation of the variables: 
(a) (L+ y*)eda + (1+ 2*)dy= 0. 
(b)  ye®*®da — (1 + e*)dy = 0. 
2. Solve the following homogeneous equations: 
(a) ydxz+ au — y)dy = 0. 
(6) aydu + (a? + y*)dy = 0, 
(ce) w@— y+ 2ayy’ = 0. 
(2) (w+ y)dx + (y — 2)dy = 0. 
(e) (@ + ayy’ = @ VV (a — 9%) + ay + y* 
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3. Show that a differential equation of the form 


az + wae) 
= eT Q, d,,... constant) 
ad am + by + cy outs 


can be reduced to a homogeneous equation as follows. If ab, — a,b + 0, 
we take a new unknown function and a new independent variable 


n=artbyt+e = ae+ by + 
Tf ab, — a,b = 0, we need only change the unknown function by putting 
n= ax + by 


to reduce the equation to a new equation in which the variables are sepa- 
rated. 
4, Apply the method of the previous example to 
(4) (2% + 4y + 3)y’ = 2y+ a+ 1. 
(b) (8y — 7x + 3)y’ = 38y — Ta + 7. 
5. Integrate the following linear differential equations of the first order: 


os * fe ny = eX 7 
(a) y+ ycoosr=cosrsingz. (b) y zl e*(% + 1)", 


(c) 2% — 1)y’ + (1 — 2x)jy + a? = 0. (d) yf — lyst. 


1 
“ = —_..., 
() (+ ay + y= a 
6. Integrate the equation 
1 
ee —e 
¥+y¥=es 


3. Determination of the Solution by Boundary Values. The 
Loaded Cable and the Loaded Beam. 


In the problems of mechanics and the other examples pre- 
viously discussed, we selected from the whole family of functions 
satisfying the differential equation a particular one by means of 
so-called initial conditions, that is, we chose the constants of 
integration in such a way that the solution and in certain cases 
also its derivatives up to the (n— 1)-th order assume pre- 
assigned values at a definite point. In many applications we 
are concerned neither with finding the general solution nor with 
solving definite initial-value problems, but instead with solving 
a so-called boundary-value problem. In a boundary-value problem 
we are required to find a solution which must satisfy pre-assigned 
conditions at several points and which must be considered in the 
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intervals between those points. Here we shall discuss a few 
typical examples without going into the general theory of such 
boundary-value problems. 


Ex, 1.—The Differential Equation of a Loaded Cable. 


In a vertical xy-plane—in which the y-axis is vertical—we suppose 
that a cable whose (constant) horizontal component of tension is S is 
stretched from the origin to the point 
x=a, y=b (cf. fig. 7). The cable 
is acted on by a load whose density per 
unit length of horizontal projection 
is given by a sectionally continuous 
function p(x). Then the sag y(x) of 
the cable, that is, the y-co-ordinate, 
is given by the differential equation 


y’ (x) = g(a), where gn) =F 


Fig. 7.—Loaded cable The shape of the cable will then be 

given by that solution y(x) of the 

differential equation which satisfies the conditions y(0)= 0, y(a) = b. 
The solution of this boundary-value problem can be written down at once, 
since the general solution of the homogeneous equation y’” = 0 is the 
linear function cp -++ ¢,2, and the solution of the non-homogeneous equation 
which, with its first derivative, vanishes at the origin is given by the 


Xe 
integral if. 9(&)(a — &)d& (see below, pp. 441-8). In the general solution 
0 


Yl) = 6 + oye + f "g(EN(a — Ede 


the condition y(0) = 0 at once gives cy) = 0, and then the condition y(a) = b 
gives the equation 


b= oa+ f “Ea — Ede 


for the determination of ¢,. 

In practice, besides this very simple form of boundary-value problem 
a more complicated case occurs, in which the cable is subject not only to 
the continuously-distributed load but also to concentrated loads, that is, 
loads which are concentrated at a definite point of the cable, say at the point 
x = 2%. Such concentrated loads we shall consider as ideal limiting cases 
arising as ¢« > 0 from a loading p(x) which acts only in the interval 2) — ¢ 
to a -+ « and for which 


Xo e 
i u)de = P, 


Xe—e 


that is, the total loading remains constant during the passage to the limit 
e¢ +0; the number P is then called the concentrated load acting at the 


vq] FURTHER EXAMPLES 435 


point 25. By integrating both sides of the differential equation y” = p(x) /S 
over the interval from z— ¢ to x-++ « before making the passage to the 
limit « > 0, we see that the equation y’(x) + ©) — y/(a%y — &) = P/S holds. 
If we now perform the passage to the limit « — 0, we obtain the result 
that a concentrated load P acting at the point x) corresponds to a jump 
of the derivative y’(~) by an amount P/S at the point 2». 

The following example suffices to show how the occurrence of a con- 
centrated load modifies the boundary-value problem. We suppose that 
the cable is stretched between the points = 0, y= Oandzr=l,y=1 
and that the only load is a concentrated load of magnitude P acting 
at the mid-point x= 4. According to the above discussion, this physical 
problem corresponds to the following mathematical problem: we have to 
find a continuous function y(x) which satisfies the differential equation 
y” = 0 everywhere in the interval 0 < x < 1, except at the point x = 4, 
which takes the values y(0) = 0, y(1) = 1 on the boundary, and whose 
derivative has a jump of the amount P/S at the point a». In order to 
find this solution, we express it in the following way: 


y(x)=ar-+b for OSe2Sh 
and 
y(z)=cl—2)+d for 45251. 


The condition (0) = 0, y(1) = 1 gives 6 = 0, d= I. From the condition 
that Donk parts of the function shall give the same value at the point 
2 = 4 we find that 


fa=4e4 1. 


Finally, the requirement that the derivative y’ shall increase by the amount 
P/S on passing the point 4 gives the condition 


P 
—c—a=-—. 
We therefore have the constants 
P P 
a=1— 5% b= 0, ca —l— py d@=1, 


and our solution is thus determined. Moreover, it is easy to show that 
no other solution with the same properties exists. 


Ez, 2.—The Loaded Beam.* 

The situation in the case of loaded beams is very similar (cf. fig. 8). 
Let us suppose that in its position of rest the beam coincides with the 
x-axis between the abscisse x= 0 and x= a. Then it is found that the 
sag y(x) due to a force acting vertically in the y-direction is given by the 
linear differential equation of the fourth order 


y = 9(x), 


*¥or the theory of loaded beams cf. e.g. Morley, Theory of Structures 
(Longmans, Green & Co., 1927). 
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where the right-hand side 9(z) is p(x)/HI, p(x) being the density of loading, 
E the modulus of elasticity of the material of the beam (Z is the stress 


Fig. 8.--Loaded beam 


divided by the elongation), and J the moment of inertia of the cross-section 
of the beam about a horizontal line through the centre of mass of the 
cross-section. 
The general solution of this differential equation can at once be ex- 
pressed (p. 446) in the form 
Fa (ce — Ep 
Ya) = eo + eye + oat + og? + f° 9(B) =F 


dé, 


where C9, ¢, C2, c; are arbitrary constants of integration. The real problem, 
however, is not that of finding this general solution, but that of finding 
a particular solution, i.e. of determining the constants of integration in 
such a way that certain definite boundary conditions are satisfied. If 
e.g. the beam is clamped at the ends, the boundary conditions 


y(0)= 0, y(a)=0, (0) = 0, y(a)=0 
hold. It then follows at once that cy) = ¢, = 0, and the constants c, and cy 
are to be determined from the equations 


oat + ont + [96 ea =o, 


(a— & 
er ae d&=0. 

With beams the occurrence of concentrated loads is again of particular 
interest. As before, we shall think of the concentrated load acting at the 
point x = zy as arising from a loading p(x), distributed continuously over 


Xo be 
the interval x, — ¢, to x) + «, for which f p(E)d— = P; we again let ¢ 

Xen e€ 
approach zero and at the same time let p(x) increase in such a way that 
the value of P remains constant during the passage to the limit ¢ > 0. 
P is then the value of the concentrated load at «= 2,. Just as in the 
example above, we integrate both sides of the differential equation over 
the interval from z—e to + « and then perform the passage to the 
limit «+0. It is found that the third derivative of the solution y(x) 
must have a jump at the point z = x», this jump amounting to 

P 


oq + 0) — yao — 0) = ar 


Here y(x) + 0) means the limit of y(x, + h) as A tends to zero through 


2c, + Bega? + foe) 
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positive values, y(2)— 0) being the corresponding limit from the left. 
Thus the following mathematical problem arises: we attempt to find 
a solution of y*” = 0 which, together with its first and second derivatives, 
is continuous, for which y(0) = y(1) = y/(0) = y(1) = 9, and whose third 
derivative has a jump of the amount P/EI at the point «= 7, and else- 
where is continuous. 
If the beam is fixed at a point 2 = a, (cf. fig. 9), ie. if at this point the 


Y; 


Fig. 9.--Sag of beam supported in the middle 


sag has the fixed pre-assigned value y = 0, we can think of the fixation 
as being effected by means of a concentrated load acting at that point. 
By the mechanical principle that action is equal to reaction the value of 
this concentrated load will be equal to the force which the fixed beam 
exerts on its support. The magnitude P of this force is then given at once 
by the formula 

P = Blfy’"(x9 + 9) — ¥(%o — O)}, 


where y(z) satisfies the differential equation y*? = P/HI everywhere in the 
interval 0 <2 < 1 except at the point + = 2, and in addition also satisfies 
the conditions (0) = y(1) = y'(0) = y/(1) = 0, (xo) = 0, and y, y’, and 
y”’ are also continuous at % = 2%. 

In order to illustrate these ideas we consider a beam extending from 
the point # = 0 to the point + = 1, clamped at its end-points x = 0 and 
«x = 1, carrying a uniform load of density p(a) = 1, and supported at the 
point ¢ = 4 (cf. fig. 9). For the sake of simplicity we assume that ZI = 1, 
so that the beam satisfies the differential equation 

yt? = 1 
everywhere, except at the point « = 4. 

As the formula shows, the general solution of the differential equation 
is a polynomial of the fourth degree in x, the coefficient of «4 being 1/4!. 
The solution will be expressed by a polynomial of this type in each of 
the two half-intervals, For the first half-interval we write the polynomial 
in the form 


y = bo t+ bz + bo? + byt + Tat, 
in the second half-interval, in the form 
y = eo+ ele — 1) + oe — IP + ogfe— E+ Fe UY 
Since the beam is clamped at the ends x = 0 and a = 1, it follows that 
y(0) = y(1) = (0) = y'(1) = 9 
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whence we obtain by) = b, = cy = ¢, = 0. In addition, y(zx), y/(2), (x) 
must be continuous at the point z= 1; that is, the values of yd), (3) 
y(4) calculated from the two polynomials must be the same, and the 


value of y($) must be zero. This gives 
1 1 1 1 


1 1 
“6b i ;) ne ae a — =), 
get g Pst 5a, = ge gt 35 =O 
3 1 3 1 
Mr re at 7% — Fe 


2b, + 3b, = 2c, — 3c. 
From this we obtain the following values for by, bg, C2, C3: 
1 


bg = Cg = —}; bs = —tg = — — 


96 24 


and the force which must act on the beam at the point « = 4 in order that 
no sag may occur at that point is given by 


G+) r= (4-D- (m4 =-} 
y G+o y 5 0} = (6c, 5 Osis z 
4. LingaR DirrerentiaL Equations 


1. Principle of Superposition. General Solutions. 


Many of the examples previously discussed belong to the 
general class of linear differential equations. A differential 
equation in the unknown function u(z) is said to be linear of the 
n-th order if it has the form 


ua) + aqu®— P(x) +... + aqu(z) = (2), 


where a1, dj, dg, ... , @, are given functions of the independent 
variable x, as is also the right-hand side ¢(x). The expression on 
the left-hand side we shall denote by the abbreviation L{u] 
(“ linear differential expression of the n-th order ”’). 

If 4(z) is identically zero in the interval under consideration, 
we say that the equation is homogeneous; otherwise, we say that 
it is non-homogeneous. We see at once (as in the special case of 
the linear differential equation of the second order with constant 
coefficients, discussed in Vol. I, p. 510) that the following principle 
of superposition holds: if 4, tu, are any two solutions of the 
homogeneous equation, every linear combination of them, 
U = CU, + CoM, where the coefficients c,, c, are constants, 
is also a solution. 
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If we know a single solution v(x) of the non-homogeneous 
equation L[u] = d(x), we can obtain other such solutions by 
adding to v(x) any solution of the homogeneous equation. Con- 
versely, any two solutions of the non-homogeneous equation 
differ only by a solution of the homogeneous equation. 

For.n = 2 and constant coefficients a,, ag we proved in Vol. I, 
Chap. XI (p. 508) that every solution of the homogeneous equation 
can be expressed in terms of two suitably chosen solutions w,, u, 
in the form ¢,% + Cg%_. An analogous theorem holds for any 
homogeneous differential equation with arbitrary continuous 
coefficients. 

To begin with, we explain what we mean by saying that a 
system of functions are linearly dependent or linearly indepen- 
dent, by means of the following definition: functions ¢,(z), 
f(z), ..., Pp(z) are linearly dependent if n constants q,..., Cn 
exist, which do not all vanish and which satisfy the equation 


¢9,(x) + Cobo(x) + anece + Cn Pal) =0 


identically, that is, for all values of x in the interval under con- 
sideration. Then if c, + 0, say, ¢,(x) may be expressed in 
the form 


Pnl(2) ram a4, (2) ee oe Gn1Pn—(2), 


and ¢, is said to be linearly dependent on the other functions. 
If no linear relation of the form 


Cy b4(x) + Caho(a) +... + enbalz) 
exists, the m functions ¢,(z) are said to be linearly independent. 


Ex. 1.—The functions 1, 2, 27,..., 2% 1 are linearly independent. 
Otherwise, constants ¢p, ¢,,..., Cj, would have to exist such that the 
polynomial 

Cy + yt + oe. Ht Cy yet 


vanishes for all values of 2 in a certain interval. This, however, is im- 
possible unless all the coefficients of the polynomial are zero. 

Ex. 2.— The functions e%* are linearly independent, provided 
Gy << dg See Say 

Proof-—We assume that this statement has been proved true for 
(x — 1) such exponential functions. Then if 


cyeFs® +. cet +... + ¢,6%* = 0 
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is an identity in z, we divide by e%.” and, putting a, — a, = b,, obtain 
oe%% + cred? 4... + 6, 1 e%-1% + 6, = 0, 

If we differentiate this equation with respect to x, the constant ¢, dis- 
appears and we have an equation which implies that the (n ~ 1) functions 
ebs@, est, .., e'n-1® are linearly dependent, from which it follows that 
et, eM, ..., e%n-1” are linearly dependent, contrary to our original 
assumption. Hence there cannot be a linear relation between the » 

original functions either. 


Ex. 3.—The functions sinz, sin2z, sin3zx,..., sinnz are linearly 
independent in the interval 0 S 2 Sm. We leave the reader to prove this, 


+a 
using the fact that f sinmz sinnz da = ( rae ee * (Cf. Vol. I, p. 217.) 
—7 


If we assume that the functions ¢,(z) have continuous 
derivatives up to the (nm — 1)-th order, we have the following 
theorem: 

The necessary and sufficient condition that the system of func- 
tions $,(x) shall be linearly dependent is that the equation 


(2) 22) C2) 
W = $y (2) s'(a) +++ Gy (2) 90 
PE M(a)  PE-M@) 2. AP) 
shall be an identity in x. In addition the n determinants formed 
from (n — 1) of the functions must not vanish simultaneously at 
any point. The function W is called the Wronskian of the 
system of functions.* 
That the condition is necessary follows immediately: if we 
assume that Xe,d,(x) = 0, 


successive differentiation gives the further equations 
Leh, (x) = 0, 


Leh-Y(a) = 0. 
These, however, form a homogeneous system of n equations, 
which are satisfied by the n coefficients ¢,...,¢,; hence W, 
the determinant of the system of equations, must vanish. 
That the condition is sufficient, that is, that if W == 0 the 
functions are linearly dependent, may be proved in various ways. 


*In this proof and the following one a knowledge of the elements of the 
theory of determinants is assumed. 
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One proof is as follows. From the vanishing of W we may deduce 
that the system of equations 


Gd, +--+ nbn =0 

ad +...+trnd’ =0 

6," ) fice+ Cahn YD =0 
possesses a solution ¢, ¢y,...,¢,» Which is not trivial, where 
c, may still be a function of z. Here we may assume without loss 
of generality that c, = 1. Further, we may assume that V, the 
Wronskian of the (n — 1) functions 4, $.,..-, ¢n—» 18 not 
zero, for we may suppose that our theorem has already been 
proved for (n — 1) functions; then V = 0 implies the existence 
of a linear relation between ¢,, ¢a, . . - , dn—1, and hence between 


d1, $a $3, ---> bn BY differentiating * the first equation with 
respect to x and combining the result with the second, we obtain 


ody — Ce’ he ee Si Cnt bn-1 = 0; 


similarly, by differentiating the second equation and combining 
the result with the third, we obtain 


©y' by! + Cy be’ + oe + ona baa = 9, 
and so on, up to 
c,h, °-® + Ce "bs (n—2) eee + Cn-1' GO? = es 


Since V, the determinant of these equations, is assumed not 
to vanish, it follows that ¢,', c’,...,; Cn’ are zero; that is, 
Cy, Cg +++» Cn4 are constants. Hence the equation 


Eevb,(z) = 0 


does actually express a linear relation, as was asserted. 
We now state the fundamental theorem on linear differential 
equations: 
Every homogeneous linear differential equation 


Llu] = ay(x)u x) + a,(z)u"-"x) +... + a,u(z) = 0 
*Tt is easy to see that the coefficients c; are continuously differentiable 


functions of x; for, if the determinant V is not zero, they can be expressed 
rationally in terms of the functions ¢, and their derivatives. 
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possesses systems of n linearly independent solutions v4, Ug, .. . 5 Up 
By superposing these fundamental solutions every other solution 
u may be expressed * as a linear expression with constant coefficients 
Cy, ees Cy! % 
u = Ley. 

i=1 

In particular, a system of fundamental solutions can be 
determined by the following conditions. At a prescribed point, 
say x= &€, u, is to have the value 1 and all the derivatives of w, 
up to the (n — 1)-th order are to vanish; u;, where 7 > 1, and all 
the derivatives of u; up to the (n — 1)-th order, except the i-th, 
are to vanish, while the i-th derivative is to have the value 1. 

The existence of a system of fundamental solutions follows 
from the existence theorem proved in the next section (p. 450). 
It follows from Wronski’s condition, which we have just proved, 
that a linear relation must exist between any further solution 
wu and %,..., Uy; for from the equations 


n 
La uler—D —= 0 


l=0 
n 
Dayuir-Y = 0 C= 1, eee, n) 
1=0 
it follows that the Wronskian of the (m+ 1) functions 
U, Uy, Ug, ..., Uy Must vanish, so that u, wy, Ua, ..., Up, are 


linearly dependent. Since u,,..., u, are independent, wu depends 
linearly on wy, ...5 Un. 


2. Homogeneous Differential Equations of the Second Order. 
We shall consider differential equations of the second order 
in more detail, as they have very important applications. 
Let the differential equation be 
Llu] = au!’ -+ bu’ + cu = 0. 
Tf u(x), u(x) are a system of fundamental solutions, 
W = ute’ — uu,’ is its Wronskian, and W’ = uu,” — tu," 
Since 
L{uj=90 and Llu] = 0, 


*Two different systems of fundamental solutions u,, ..., tUnj Ujy+0+5 
can be transformed into one another by a linear transformation 


n 
Vg = Legh 
k=1 


where the coefficients c,; are constants and form a matrix whose determinant 
does not vanish. 
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it follows that 
u,L{u,|] — u-L[u,] = aW’ + bW =0. 


Hence by integration 
b 
k + log|W| = — f dz, 
or 
fede 
W = ce-JS5%, 
where c is a constant. This formula is used a great deal in the 
more detailed theory of differential equations of the second 
order. 
Another property worth mentioning is that a linear homo- 
geneous differential equation of the second order can always be 


transformed into an equation of the first order, known as 
Riccatr’s differential equation. Riccati’s equation is of the form 


ov +v4+q+r=0, 
where v is a function of #; or, in a slightly more general form, 
o + per+ qu+r=0, 


which is obtained from the first form by putting v= z/p. The 
linear equation is transformed into Riccati’s equation by putting 
uw’ = uz, so that u” = uz + uz’ = uz* -+ uz’, and we have 


az +a2+ bz +ec=0. 


A third remark: if we know one solution v(x) of our linear 
homogeneous differential equation of the second order, the prob- 
lem is reduced to that of solving a differential equation of the 
first order, and can be carried out by quadratures. In fact, if we 
assume that L{v] = 0 and put w= 2v, where 2(x) is the new 
function which we are seeking, we obtain the differential equation 


az'’y + 2az’v!’ + bz'v + 2L[v] = ave"’ + (2av’ + bv)’ = 0 


for z. This, however, is a linear homogeneous differential equation 
for the unknown function z’ = w; its solution is given on p. 429. 
From w we then obtain the factor z, and hence the solution u, 
by a further quadrature. 


Example.—The linear equation of the second order 


o_9¥ 194 0 
rag a 
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is equivalent to Riccati’s equation 


2 2 
eo ~ oss 
+2 ots 0, 


where z= 9’/y. The original equation has y = a as a particular solution; 
hence it may be reduced to the equation of the first order 


vx = 0, 
where v= y/x. That is, v= ax-+ 6. Hence the general integral of the 
original equation is given by 
y = ax? + ba. 


We would expressly emphasize that exactly the same method 
can be used to reduce a linear differential equation of the n-th 
order to one of the (n — 1)-th order, when one solution of the 
first equation is known. 


EXaMPLes 


1. Prove that if a,...,@, are different numbers and P,(z),..., 
P,(z) are arbitrary polynomials (not identically zero), then the functions 
pi(z) = P,(x)ea*, ..., 9,(2%) = P,(x)e%” are linearly independent. 

2. Show that the so-called Bernoulli’s equaiion 

y + afz)y = b(x)y” (n= 1) 
reduces to a linear differential equation for the new unknown function 
z=y", Use this to solve the equations 

(a) ay + y= y loge. 

(6) ay*(ay’ + y) = a. 

(c) (1 — 2%) — ay = aay? 

3. Show that Riccati’s differential equation 

y + P(x)y* + Qz)y + R(x) = 0 
can be transformed into a linear differential equation if we know a particular 
integral y, = y,(z). (Introduce the new unknown function u = 1/(y — y,). 
Use this to solve the equation 
y — vy +a2t—-1=0 
which possesses the particular integral y, = 2. 

4, Find the integrals which are common to the two differential equations 

(a) f= y+ 2x — xt (6) Y= —yP—yt a+ + at 

6*. Integrate the differential equation 

y =y' 4+ 2a — at 
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in terms of definite integrals, using the particular integral found in Ex. 4, 
Draw a rough graph of the integral curves of the equation throughout the 
xy-plane. 
6*. Let yy Ys Ya Y4 be four solutions of Riccati’s equation (cf. Ex. 3). 
Prove that the expression 
: ¥— Ys . Y2— ¥s 
Y— Ys Ye Ms 
is a constant. 
7. Show that if two solutions, y,(x) and y,(z), of Riccati’s equation are 
known, then the general solution is given by 


y— 4 = ey — yee JP e, 


where ¢ is an arbitrary constant. 
Hence find the general solution of 


—y tanz = y? = : 
yy aes one cos @ 
which has solutions of the form @ cos". 
8. Prove that the equations 
(a) (L— x)y” + zy’ — y= 0, 
(6) 2a(2% — jy” — (40° + ly + y(2z+ 1)=0 
have a common solution. Find it, and hence integrate both equations 
completely. 


3. The Non-homogeneous Differential Equation. Method of 
Variation of Parameters. 


To solve the non-homogeneous differential equation 
Llu] = au +... +4,u= (2) 


it is sufficient, by what we have said on p. 439, to find a single 
solution. This may be done as follows. By proper choice of 
the constants c, Ca, ..., Cp», we first determine a solution of 
the homogeneous equation L[u] = 0 in such a way that the 
equations 


u(é)= 0, w(Z)=0, ..., um %(Z)=0, um MEZ)=1 


are satisfied. This solution, which depends on the parameter @, 
we denote by u(z, £). The function u(z, €) is a continuous function 
of ¢ for fixed values of z, and so are its first n derivatives with 
respect to 2 As an example, for the differential equation 
u’ + ku =0 the solution u(x, €) has the form sink(x — €)/k, 
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and this fulfils the conditions stated above. We now assert that 
the formula 


v(a) = f $(f)u(a, &)dé 


gives a solution of Z[u] = ¢ which, together with its first n — 1 
derivatives, vanishes at the point* «= 0. To verify this state- 
ment we differentiate the function v(x) repeatedly with respect 
to x by the rule for the differentiation of an integral with respect 
to a parameter (cf. Chap. IV, section 1, p. 220), and recall the 
relations 


u(x, z)= 0, u'(a,z)=0, ..., ua, 2) = 0, ua, 2) = 1 


(where e.g. u’(x, x) = Ou(z, €)/Ox for = a). 
We thus obtain 


v(e) = gue, +f (Ou'lw, de = fd (u'(w, Oe, 
t=x 40 0 
va) = $Ou'(a, | +f d(eu"@, Hae = fH Ou" wOAE, 
g=x “0 0 
Mo) = $lEu—Ma, 2) +f HOW Me, Ode 
| =f Hen, eae, 
(a) = HeuMa, OL + ['HeUMe, Bae 


= $@) +f “(Eu (a, 2) dé. 


Since L[u(x, £)] = 0, this establishes the equation L[v] = ¢(z) 
and shows that the initial conditions (0) = 0, v’(0)=0,..., 
v0) = 0 are satisfied. 

The same solution can also be obtained by the following 


* The physical meaning of this process is this. If x = ¢ denotes the time and 
« the co-ordinate of a point moving on a straight line subject to a force ¢(z), 
the effect of this force may be thought of as arising from the superposition of 
the small effects of small impulses. The above solution u(x, ¢) then corresponds 
to an impulse of amount | at time ¢, and our solution gives the effect of im- 
pulses of amount ¢( £) during the time between 0 and z. We cannot go further 
into the details here. 
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apparently different method. We seek to find a solution u of the 
non-homogeneous equation in the form of a linear combination 
u = Dyi(a)eu(z), 

but now we must allow the coefficients y, to be functions of 2. 
On these functions we impose the following conditions: 

1 Uy + Yo'Ue + eet yn =) 

Ya'My tM! +... yn'ty’ == 0 


yay?) + Yq Ug'"—») +...t Yn Ug" = 0. 


From these it follows that the derivatives of u are given by the 
following formule: 


u 
u 


= Lyu,’ 
a” = Sy,” 
uD — Syuee-D 

uM = Dy/u—Y + Ly a. 


Substituting these expressions in the differential equation and 
remembering that L[u] = ¢, we have 


Ly,uj*—) = g(a). 


For the coefficients y,/ we obtain a linear system of equations, 
whose determinant is W, the Wronskian of the system of funda- 
mental solutions #,, and therefore does not vanish. Thus the 
coefficients y,’ are determined, and hence by quadratures the 
coefficients y,. As the whole argument can be reversed, a solution 
of the equation has actually been found, and in fact all solutions, 
in virtue of the integration constants concealed in the coefficients 
Yer 

We leave it to the reader to suow that the two methods are 
really identical, by expressing u(x, €), the solution of the homo- 
geneous equation defined above, in the form 


u(x, €) = Za, é)u,(z). 


The latter method is known as variation of parameters, 
because here the solution appears as a linear combination of 
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functions with variable coefficients, whereas in the case of the 
homogeneous equation these coefficients were constants. 


Example.—We consider the equation 
wu” — 24 4 2% = wet, 
a x 


By p. 443, a system of fundamental solutions of the corresponding 
homogeneous equation 


"919% _9 
. rans a 


is given by u, = 2, uw, = 2%. Hence if we seek solutions of the form 


Ue 2 + Yor", 
we have the conditions 
yu + y:'2" = 0, 
Ya + 2y2'x = xe® 
for y, and y,. That is, 
YY = —e", Ye’ = e%. 
Hence the genera) solution of the original non-homogeneous equation is 


u == we* + c.4 + cnn, 


4. Forced Vibrations. 


As an application we shall give a brief account of a method for 
dealing with forced vibrations, in which the right-hand side ot 
the differential equation need no longer be a periodic function, 
as in the cases considered in Vol. I, Chap. XI, section 3 (p. 510), 
but may instead be an arbitrary continuous function f(t). For 
the sake of simplicity we restrict ourselves to the case where 
there is no friction and take m= 1 (or, what amounts to the 
same thing, divide through by m). We accordingly write the 
differential equation in the form 


H(t) + walt) = $(0) 


where the quantity «? is what we previously called &, and the 
external force is denoted by ¢ instead of f. 
According to p. 446, the function 


Fo =! a "f(A) sin x(t — AJGA 
KwY@ 
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is a solution of the differential equation Z ++ «a = ¢(t), and satis- 
fies the initial conditions 


F(0)=0, F(0)=0. 


For the general solution of the differential equation we thus. 
obtain, just as before, the function 


x(t) = * [ 40) sin«(t — A)dA + ¢, sinkt + c, cosxt, 


where ¢, and ¢, are arbitrary constants of integration. 

If, in particular, the function on the right-hand side of the 
differential equation is a purely periodic function of the form 
sinwt or coswt, a simple calculation shows that we again obtain 
the results of Vol, I, Chap. XI, section 3. 


ExaMPLes 


1*. Prove that the linear homogeneous equation 
Ly) = ff 4 ey) +... + eniy’ + On = 0 
with constant coefficients ¢ has a system of fundamental solutions of the 
form 2e%k®, where the a,’s are the roots of the polynomial 
f(z) = 2? + ez t+... ty 

2. Integrate the following equations: 

(a) y”’—y=0. (0) yo” — 4y” + By’ — y= 0. 

(c) y/” — 8y” + 3y —~y=0.  (d) yo — 3y” + y=. 

(e) xty” + ay’ —-y= 0. 

3. Let 

ay + ay’ +... +4,y™ = Plz) 

be a linear non-homogeneous differential equation of the n-th order with 


constant coefficients, and let P(x) be a polynomial. Let a) + 0 and consider 
the formal identity 
1 j 

ee Hb ttt bt wwe 
a+at+...+ae °° ? f 
Prove that 

y = boP(x) + 0,P’(x) + 6.P’(x) +... 
is a particular integral of the differential equation. 

If a = 0, but a, + 0, then the expansion 


1 
——_4 = > = et bata 
at + af? +... + aye” ts be + yt + bgt + 


16 (8912) 
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is possible. Prove that now 
ye ik P(x) dar + byP(x) + b,P’(2) + byP"(a) Lo. 


is a particular integral of the differential equation. 

4, Apply the method of Ex. 3 to find particular integrals of 

(a) y+ y= 30°— be, (8) op” Fy = (1+ 2) 
5. A particular integral of the equation 
ay + ay’ +... + ayy = ek* P(x), 
where k, a, @,,... are real constants and P(x) is a polynomial, can be 
found by introducing a new unknown function z = 2(x) given by 
y = zeke, 


and applying the method of Ex. 3 to the equation in z. 
Use this method to find particular integrals of 


(a) y+ 4y + 8y = 8e%; (6) oy” — 2y’ + y = ae”. 
6. Integrate the equation 


y" — by’ + by = (2% — 3) 
completely. 


5. GenERAL REMARKS ON DIFFERENTIAL EQUATIONS 


Although a complete theory of differential equations would 
extend far beyond the compass of this book, we shall here sketch 
at least the elements of a general method for their treatment. 


lj. Differential Equations of the First Order and their Geometrical 
{nterpretation. 


We begin by considering a differential equation of the first 
order, that is, an equation in which the first derivative of the 
function y(x), but no higher derivative, occurs in addition to x 
and y(z). The general expression for a differential equation 
of this type is 

F(x, y, y') = 0, 


where we assume that the function F is a continuously differen- 
tiable function of its three arguments 2, y, y’. We now attempt 
to visualize the geometrical meaning of this equation. In the 
points of a plane region with rectangular co-ordinates (2, y), 
this equation prescribes a condition for the direction of the 
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tangent to any curve (x) which passes through this point and 
satisfies the differential equation. We assume that in a certain 
region R of a plane, say in a rectangle, the differential equation 
F(z, y, y’) = 0 can be solved uniquely for y’, and thus expressed 


in the form 
¥y =f, y), 


where the function f(a, y) is a continuously differentiable function 
of z and y. Then to each point (2, y) of R this differential equation 
y = f(z, y) assigns a “ direction of advance”. The differential 
equation is therefore represented geometrically by a field of 
directions; and the problem of solving the differential equation 
geometrically consists in the finding of those curves which belong 
to this field of directions, that is, whose tangents at every point 
have the direction pre-assigned by the equation y’ = f(z, ¥). 
We call these curves the integral curves of the differential 
equation. 

It is now intuitively plausible that through each point (2, y) 
of & there passes just one integral curve of the differential equa- 
tion y’ = f(z, y). These facts are stated more precisely in the 
following fundamental existence theorem: 

[f in the differential equation y’ = f(x, y) the function f ts con- 
tinuous and has a continuous derivative with respect to y in a region 
R, then through each point (Xo, Yo) of R there passes one, and only 
one, integral curve, that is, there exists one, and only one, solution 
y(x) of the differential equation for which y(X) = Yo. 

We shall return to the proof of this theorem in sub-section 4 
(p. 459). Here we confine ourselves to the consideration of some 
examples. | 


For the differential equation 


: y 
which we consider in the region y < 0, say, the direction of the field of 
directions is readily seen to be perpendicular to the vector from the origin 
to the point (x, y). From this we infer by geometry that the circular arcs 
about the origin must be the integral curves of the differential equation. 
This result is very easily verified analytically. Jor from the equation of 


these circles, 
y= Ve — » 
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it follows at once that 


2 
f= — Tea 


which shows that these circles satisfy the differential equation. 
At each point the field of directions of the differential equation 


y 


z 


obviously has the direction of the line joining that point to the origin. 
Thus the lines through the origin belong to this field of directions and are 
therefore integral curves. As a matter of fact, we see at once that the 
function y = cx satisfies the differential equation * for any arbitrary 
constant c. 

In the same way we can verify analytically that the differential equations 


x 
=, (y + 0) 
and 


y=-£ @#9 


are satisfied by the respective families of hyperbolas 
y= Ve + 2), 


c 


=-, 


x 


where c is the parameter specifying the particular curve of the family. 


Our fundamental theorem shows in general that differential 
equations of the first order are satisfied by a one-parameter 
family of functions, that is, by functions of z which depend not 
only on « but also on a parameter c (for example, on c= yg= y(0)); 
as we say, the solutions depend on an arbitrary constant of 
integration. The ordinary integration of a function f(x) is merely 
a special case of the solution of this differential equation, namely, 
the special case in which f(x, y) does not involve y. All the 
directions of the field of directions are then determined by the 
a-co-ordinate alone, and we see at once that the integral curves 
are obtained from one another by translation in the direction of 
the y-axis. Analytically this corresponds to the familiar fact that 
in indefinite integration, that is, in the solution of the differential 

* At the origin the field of directions is no ionger uniquely defined; this is 


connected with the fact that an infinite number of integral curves pass through 
this “singular point ” of the differential equation. 
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equation y’ = f(z), the function y involves an arbitrary additive 
constant c. 

The geometrical interpretation of the differential equation 
now enables us to carry out an approximate graphical integration, 
that is, a graphical construction of the integral curves, in much 
the same way as in the special case of the indefinite integration of 

yy 


10 11 


Fig. 10.—-Directions of the integral curves on the isoclines in fig. 12 
Fig. 11.—Solutions of y’ = V (x* + »*)/x by the isoclinal method 


a function of # (Vol. I, pp. 119-21). We have only to think of the 
integral curve as replaced by a polygon in which each side has 
the direction assigned by the field of directions for its initial point 
(or for any other one of its points). Such a polygon can be con- 
structed by starting from an arbitrary point in R. The smaller 
we take the length of the sides of the polygon, the greater the 
accuracy with which the sides of the polygon will agree with the 
field of directions of the differential equation, not only at their 
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initial points but throughout their whole length. Without going 
into the proof, we here state the fact that by successively 
diminishing the length of side a polygon constructed in 
this way may actually be made to approach closer and closer 
to the integral curve through the initial point. For this 
process f(x, y) need not be given explicitly; it need only be 
given graphically. 

Such a graphical integration is frequently carried out in 
practice by the so-called isoclinal method. The field of directions 
is represented by joining points with the same direction by 
curves (isoclines), that is, by sketching the family of curves 
f(x, y)=¢= const. To every value ¢ of this constant there 
then corresponds a definite direction which can, for example, 
be sketched in an auxiliary figure. An integral curve must then 
cut every isocline in the corresponding direction obtained from 
the auxiliary figure, and the construction of the integral curves 
is therefore easily carried out by drawing parallels. 


= vets) Here 


the isoclines are half-lines through the origin. The corresponding directions 
y’ agree with the correspondingly-numbered directions in the auxiliary 


fig. 10. 


2. The Differential Equation of a Family of Curves. Singular 
Solutions. Orthogonal Trajectories. 


Fig. 11 shows the graphical integration * of y’ 


The existence theorem shows that a family of curves corre- 
sponds to every differential equation. This suggests the question 
whether this statement is reversible. In other words, does every 
one-parameter family of curves (x, y, c)=0 or y= git, ©) 
have a corresponding differential equation . 


F(a, y, y’) = 9 


which is satisfied by all the curves of the family, and how can we 
find this differential equation? Here the essential point is that ¢, 
the parameter of the family of curves, does not occur in the 
differential equation, so that the differential equation is in a 
sense a representation of the family of curves not involving a 

* This differential equation can be integrated explicitly by introducing 


polar co-ordinates, but the result of this explicit integration is by no means 
so clear and easy to discuss. 
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parameter. In fact, it is easy to find such a differential equation. 
Differentiating the equation 


P(x, y, c) = 0 
with respect to x, we have 
bz oe dyy’ = 0. 


If ¢, is not identically zero, and if we eliminate the parameter 
¢ between this equation and the equation ¢= 0, the result is 
the desired differential equation. This elimination is always pos- 
sible for a region of the plane in which the equation ¢ = 0 can be 
solved for the parameter c in terms of and y. We then have 
only to substitute the expression c= c(z, y) thus found in the 
expressions for ¢, and ¢, in order to obtain a differential equation 
for the family of curves. 


As a first example we consider the family of concentric circles 
w+ y*— ¢c?= 0, from which, by differentiation with respect to 2, we 
obtain the differential equation 

at yy’ = 0, 


in agreement with p. 451. 

Another example is the family (# — c)? + y? = I of circles with unit 
radius and centre on the a-axis. By differentiation with respect to x we 
obtain 


(z—¢) + yy’ = 0, 
and on eliminating ¢ we obtain the differential equation 
1-YH=¥7y? or FIt+y?)=1. 
The family y = (a — c)* of parabolas touching the x-axis likewise leads 
by way of the equation y’ = 2( — c) to the required differential equation 
y? = 4y. 


In the last two examples we see that the corresponding 
differential equations are satisfied not only by the curves of 
the family, but in the first case by the lines y= 1 and y= —1 
also, in the second case by the z-axis y= 0 also. These facts, 
which can at once be verified analytically, follow without calcu- 
lation from the geometrical meaning of the differential equation. 
For these lines are the envelopes of the corresponding family of 
curves, and since the envelopes at each point touch a curve of 
the family, they must at that point have the direction prescribed 
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by the field of directions. Therefore every envelope of a family of 
integral curves must itself satisfy the differential equation. 
Solutions of the differential equation which are found by forming 
the envelope of a one-parameter family of integral curves are 
called singular solutions.* 

If to each point P of a region R which is simply covered by 
a one-parameter family of curves @(z, y) = c= const. we 
assign the direction of the tangent of the curve passing through 
P, we obtain a field of directions defined by the differential 


equation y! = — s (see above). If, on the other hand, to each 


v 
point P we assign the direction of the normal to the curve passing 
through it, the resulting field of directions is defined by the 
differential equation 


The solutions of this differential equation are called the ortho- 
gonal trajectories of the original family of curves (x, y) =. 
The curves ®=c and their orthogonal trajectories intersect 
everywhere at right angles. Hence if a family of curves is given 
by the differential equation y’ = f(z, y), we can find the differential 
equation of the orthogonal trajectories without integrating the 


* It is remarkable that we can find singular solutions of a differential equation 
F(x, y, y’) = 0 without integrating the differential equation, that is, without 
having the one-parameter family of ordinary solutions to start from. For we 
recall that by our fundamental theorem the solution of the differential equation 
is uniquely determined in the neighbourhood of a point (x, y) when in this 
neighbourhood the differential equation can be written in the form 9’ = f(x, y), 
where f(x, y) is a continuously differentiable function. It follows that at the 
points through which both a member of the family and also a singular solution 
pass, such an expression must be impossible. In the neighbourhood of this 
point (z, y) the differential equation F(x, y, y’) cannot have a solution in the 
above form. The theorem on implicit functions in Chap. III, section 1 (p. 117), 
however, states that such a solution is possible if Fy +: 0 at the place in question. 
We thus find that a necessary (but by no means a sufficient) condition for a 
point of a singular solution is that the equation 


C) 
oy’ 


is satisfied. If we eliminate y’ between this equation and the given differential 
equation, we obtain an equation between 2 and y which the singular solution 
must satisfy (if it exists). The examples above confirm this rule. Thus from 
the differential equation 4°(1 + y’*) = 1 we obtain the equation y*y’ = 0 by 
differentiating with respect to y’. From these two relations we have y* = 1, 
or y = +1, which are the singular solutions found above. 


F(z, yy’) = 0 
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given differential equation, for the equation of the orthogonal 
trajectories is 
i 1 
YS 
f(a 9) 

In the examples discussed above, from the differential equation satisfied 
by the circles V(a* + y*) = ¢ we find that the differential equation of the 
orthogonal trajectories is y’ = y/x. The orthogonal trajectories are there- 
fore straight lines through the origin (see p. 452), 

If p> 0, the family of confocal parabolas (cf. Chap. II, p. 137) 
y? — 2p(x + p/2) = 0 satisfies the differential equation 


y= ii +/+ 99). 


Hence the differential equation of the orthogonal trajectories of this 
family is 


y = Ye — le + yp 


“Cet Vet y 
The solutions of this differential equation are the parabolas 
y? — 2pla + p/2) = 0, 


where p < 0, which are parabolas confocal with one another and with the 
curves of the first family. 


3. The Integrating Factor. (Euler’s Multiplier.) 
If we write the differential equation y’ = f(z, y) in the form 
dy —f(z, y)da = 0, 


where dz and dy are the differentials of the independent and 
dependent variables respectively (for the idea of the differential 
see Chap. IT, p. 66), and multiply by an arbitrary non-vanishing 
factor O(a, y), we arrive at an equivalent differential equation of 
the form 


a(x, y)dx + b(z, y)dy = 0. 


The problem of the general solution of the differential equation 

consists in finding a function y(x) such that this differential equa- 

tion for the differentials dx and dy is satisfied identically in z. 

In one case such a solution can be given immediately; namely, 

when the expression adx-+ bdy is the total differential of a 

function F(a, y), that is, if a function F(x, y) exists for which 
16° (B912) 
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a= OF /ox and b = oF /oy. The differential equation then becomes 


ak = 0. 
This is solved if we put 
F(x, y) = ¢, 


where ¢ is an arbitrary constant of integration c, and from this 
equation we calculate y as a function of x and of the constant 
of integration ¢. 

According to Chap. V (p. 354), a necessary and sufficient 
condition that adz-+ bdy may be the total differential of a 
function F is that the condition of integrability ¢a/¢y = 0b/dx is 
satisfied. If this condition is satisfied, the line integral of the 
expression ada + bdy is independent of the path and for a fixed 
initial point P, represents a function F(x, y) of the end-point 
P with co-ordinates (x, y), and this function F gives us the 
above solution. 

In general, the coefficients a and 6 of a differential equation 
adx + bdy = 0 do not satisfy the condition of integrability. This 


is true e.g. for the differential equation dx + Y dy= 0. We can 
x 


then attempt to multiply the differential equation by a factor 
(2, y) which is chosen in such a way that after the multiplication 
the coefficients do satisfy the condition of integrability, so that 
the differential equation can be solved by evaluating a line 
integral along a particular path, that is, by a simple integration. 
In our example p(x, y) = & is such a factor. It leads to the 
differential equation «dx -++ ydy = 0, the left-hand side of which 
is the differential of the function 3(a? + y*). Thus in agreement 
with the previous result on p. 451 the solutions of the differential 
equation are the circles 2? + y? = 2c. 

In general, such a factor u(x, y), which we call an integrating 
factor or multiplier of the differential equation, is determined by 
the condition that 


jy ua) = 2 ub) or apy ~ bua + (Oy — ba) = 0. 


The still unknown integrating factor p(z, y) is therefore itself 
determined by an equation involving derivatives, and in fact 
partial derivatives with respect to 2 and y. Thus the finding of 
an integrating factor is not in theory any simpler than the 
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original problem. Nevertheless, in many cases such a factor is 
easily found by trial and error, as in the above example. The 
integrating factor, however, is chiefly of theoretical interest, 
and we shall not discuss it further here. 


4. Theorem of the Existence and Uniqueness of the Solution. 


We now prove the theorem of the existence and uniqueness 
of the solution of the differential equation y’ = f(x, y) which 
we stated on p. 451. Without loss of generality we can assume 
that for the solution y(x) in question we have y(0) = 0, for other- 
wise we could introduce y— y¥,= 7 and «—2,= € as new 
variables and should then obtain a new differential equation, 
dy/dé = f(E + x, 7+ Yo), of the same type, to which we could 
apply our argument. 

In the proof we may confine ourselves to a sufficiently small 
neighbourhood of the point «= 0. If we have proved the exis- 
tence and uniqueness of the solution for such an interval about 
the point x = 0, we can then prove the existence and uniqueness 
for a neighbourhood of one of its end-points, and so on. 

We first convince ourselves that there cannot be more than 
one solution of the differential equation satisfying the initial 
conditions. For if there were two solutions y,(x) and y,(x), for 
the difference d(x) = y, — y_ we should have 


d'(x) = f(a, yx(x)) — f(a, yo(a)). 


By the mean value theorem the right-hand side of this equation 
can be put in the form (y, — y%)f,(x, 7) = dx) f(x, 9), where 7 
is a value intermediate between y, and y,. In a neighbourhood 
| «| <a of the origin y, and y, are continuous functions of x 
which vanish at «= 0. Let 6 be an upper bound of the absolute 
values of the two functions in this neighbourhood, so that 
| ¥| Sb whenever |x| <a. Moreover, by M we shall mean a 
bound of | f,| in the region |x|<a,|y| 0b. Finally, let D 
be the greatest value of | d(x)| in the interval |z|<a. We 
suppose that this value is assumed at x= €. Then 


| d’(x) | = | d(x) f(x, 9)| S DM, 
and therefore 


D=|dé|= | fteae 


s|é€|DMsaDM 
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We can choose a so small that aM <1, for if | f,(x, y)| is 
less than M in a region | | <a, | y| SB, it continues to be less 
than M in every region obtained by reducing a. But if aM <1, 
from D=aMD it follows that D=0. ‘That is: in such an 
interval | z| <a we have * y,(x) = y,(z). 

By a similar integral estimate we arrive at a proof of the 
existence of the solution. We construct the solution by a method 
which is also important in applications, in particular, in the 
numerical solution of differential equations. This is the process 
of iteration or successive approximation. Here we obtain the 
solution as the limit function of a sequence of approximate 
solutions y9(z), y,(x), yo(z),... . Asa first approximation y,(z). 
we take y,(x) = 0. Using the differential equation, we take 


v3(2) =f "f(g, 0) dé 


as the second approximation: from this we obtain the next 
approximation y,(%), 


vate) = f TE ylé))aé, 


and in general the (n + 1)-th approximation is obtained from the 
w-th by the equation 


Yn(tt) = [3 4(& Yna(é))de. 


If in an interval | z| < a these approximating functions converge 
uniformly to a limit function y(x), we can at once perform the 
passage to the limit under the integral sign, and for the limit 
function we obtain the equation 


ua) =f f(é weyadé, 


from which it follows by differentiation that y’ = f(x, y), so that 
y is actually the required solution. 

We carry out the proof of convergence for a sufficiently small 
interval | «| <a by means of the following estimate. We put 
Ynt1(2) — Yn(x) = d,(z) and by D, denote the maximum of 
| d,,(z) | in the interval |x| < a. 

* The root idea of this proof is the fact that for bounded integrands inte- 


gration gives a quantity which vanishes to the same order as the interval of 
integration, as that interval tends to zero. 
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From the equation 
d;,(x) = Yntr = Yn = f(x, Yn) — f(a, Yn—1) 
the mean value theorem gives 
a,,(2) = dy—y(2)f, (2, Gn-1(2)), 
where ,,-; 18 @ value intermediate between y, and y,-,. Let the 
inequalities | f,(2, y)| <M, |f(c, y)|<M, hold in tho rect- 
angular region |x| <a, |y|<b. If we assume that for the 


function y, the relation | y, | Sb holds in the interval | «| Sa, 
then by the definition of ¥,4, we have 


[FG ylende 


We shall therefore choose the bound a for x so small that 
aM, <b. Then in the interval | «| < a we shall certainly have 
| Yntr(z) | Sb. Since for y(x) = 0 it is obvious that | y)| 3, 
in the interval | «| <a we have | y,(x)| 6 for every n. Hence 
in the equation 


| Ynta() | = <|a|M,<oM,. 


dyy(t) = is FE Gal €)) dnl £) 4 


we may estimate the integral on the right by using | f,|< @, 
and for the maximum D,,, of | d,4,(x) | in the interval | «| Sa 
we thus at once obtain 


Dri Ss aMD,. 


We now take a so small that aM < q <1, where q is a fixed 
proper fraction, say g= 3. Then Di, Sq), Sq" Do. 
Let us now consider the series 


do(x) + d(x) + da(x) +... + dase) +... 


The n-th partial sum of this series is y,(x). The absolute value 
of the n-th term is not greater than the number D,g"-! when 
|| <a. Our series is therefore dominated by a convergent 
geometric series with constant terms. Hence (cf. Vol. I, p. 392) 
it converges uniformly in the interval | «| < a@ to a limit function 
y(z), and thus we see that an interval | «| <a exists in which 
the differential equation has a unique solution. 

All that now remains to be shown is that this solution can 


462 DIFFERENTIAL EQUATIONS (Cap. 


be extended step by step until it reaches the boundary of the 
(closed bounded) region R in which we assume f(x, y) to be 
defined. The proof so far shows that if the solution has been 
extended to a certain point, it can be continued onward over 
an «-interval of length a, where a, however, depends on the 
co-ordinates (x, y) of the end-point of the portion already con- 
structed. It might be imagined that this advance a diminishes 
from step to step so rapidly that the solution cannot be ex- 
tended by more than a small amount, no matter how many steps 
are made. This, as we shall show, is not the case. 

Suppose that R’ is a closed bounded region entirely within 
R. Then we can find a 6 so small that for every point (x, ¥) 
in R’ the whole square x» -b6 Sa@Say+ b,yy—bSySyyt+b 
lies in R. If by M and M, we denote the upper bounds of | f(z, y) | 
and | f(z, y) | in the region R, then we find that in the preceding 
proof all the conditions imposed on a are certainly satisfied if 
we take a to be, say, the smallest of the numbers 0, 1/2M, and 
b/M,. This no longer depends on (2%, Yo); hence at each step 
we can advance by an amount @ which is a constant. Thus we 
can proceed step by step until we reach the boundary of R’. 
Since R’ can be chosen as any closed region in R, we see that the 
solution can be extended to the boundary of R. 


5. Systems of Differential Equations and Differential Equations 
of Higher Order. 


Many of the above arguments extend to systems of differential 
equations of the first order with as many unknown functions of 
g as there are equations. As an example of sufficient generality 
we shall here consider a system of two differential equations for 
two functions y(x) and 2(2), 


y = f(a, y, 2), 
2 = g(a, ¥, 2). 


We again assume that the functions f and g are continuously 
differentiable. This system of differential equations can be 
interpreted by a field of directions in xyz-space. To the point 
(x, y, 2) of space a direction is assigned whose direction cosines 
are in the ratio dv: dy: dz = 1:f:g. The problem of integrating 
the differential equation again consists, geometrically speaking, 
in finding curves in space which belong to this field of directions. 
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As in the case of a single differential equation, we again have 
the fundamental theorem that through every point of a region 
R in which the above functions are continuously differentiable 
there passes one, and only one, integral curve of the system of 
differential equations. The region R is covered by a two-para- 
meter family of curves in space. These give the solutions of the 
system of differential equations as two functions y(x) and z(x) 
which both depend on the independent variable z and also on 
two arbitrary parameters c, and ¢,, the constants of integration. 

Systems of differential equations of the first order are par- 
ticularly important in that equations of higher order, that is, 
differential equations in which derivatives higher than the first 
occur, can always be reduced to such systems. 


For example, the differential equation of the second order 
y= h(a, yy’) 


can be written as a system of two differential equations of the first order. 
We have only to take the first derivative of y with respect to x as a new 
unknown function z and then write down the system of differential equations 
y’ = 2, 
2 = Ka, y, 2). 
This is exactly equivalent to the given differential equation of the second 
order, in the sense that every solution of the one problem is at the same 
time a solution of the other. 
The reader may use this as a starting-point for the discussion of the 
linear differential equation of the second order, and thus prove the funda- 
mental existence theorem for linear differential equations. 


Here we cannot enter into further discussion of these questions, 
and for illustrations of these general remarks we shall merely 
refer to the differential equations of the second order which we 
have dealt with above (cf. pp. 442, 448). 


6. Integration by the Method of Undetermined Coefficients. 


Tn conclusion, we mention yet another general device which 
can frequently be applied to the integration of differential equa- 
tions. This is the method of integration by power series. We 
assume that in the differential equation 


y = f(z, y) 


the function f(x, y) can be expanded as a power series in the 


464 DIFFERENTIAL EQUATIONS {Cuap. 


variables x and y and accordingly possesses derivatives of any 
order with respect to and y. We can then attempt to find the 
solutions of the differential equation in the form of a power series 


y= Cot out ezte+... 


and to determine the coefficients of this power series by means 
of the differential equation.* To do this we may e.g. proceed by 
forming the differentiated series 


y! = C, + 2cgu + Bega? +..., 


replacing y in the power series for f(x, y) by its expression as 
a power series, and then equating the coefficients of each power 
of # on the right and on the left (method of undetermined co- 
efficients). Then if cy is given any arbitrary value, we can 
attempt to determine the coefficients 


C1) Ca, Cys Dgs oo 
successively. 

The following process, however, is often simpler and more 
elegant. We assume that we are seeking to find that solution 
of the differential equation for which y(0) = 0, that is, for which 
the integral curve passes through the origin. Then cy = c= 0. 
If we recall that by Taylor’s theorem the coefficients of the 
power series are given by the expressions 


1 
¢, = 7) y(0), 
vw 


we can calculate them easily. In the first place, c, = y’(0)=/(0, 0). 
To obtain the second coefficient c, we differentiate both sides of 
the differential equation with respect to « and obtain 


yY@H=fithy- 


If we here substitute 2 = 0 and the already known values y(0) = 0 
and y'(0)=/(0, 0), we obtain the value y’’(0) = 2c,. In the same 
way we can continue the process and determine the other co- 
efficients ¢,, ¢,,..., one after the other. 

It can be shown that this process always gives a galvioni if 


* The first few terms of the series then form a polynomial of approximation 
to the solution. To a certain extent, therefore, the method is the analytical 
counterpart of the approximate graphical integration mentioned on p. 453. 
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the power series for f(x, y) converges absolutely in the interior 
of a circle about «= 0, y= 0. We shall not give the proof here. 


EXaMPLes 


1. Verify that the left-hand sides of the following differential equations 
are total differentials, and integrate the equations: 


(a) (83% + 6ay*)da +- (6a%y + 4y*)dy = 0. 
(6) _ada+ydy | ydu—ady 9 
Vi-aty +e - 


2. Show how to solve the equation Mdx +- Ndy = 0, where M and N 
are homogeneous functions of the same degree. 


3. Integrate the equation 
(zy? — y*)da + (1 — ay*)dy = 0, 

which has an integrating factor independent of x. 

4, Integrate the equation 

2y° dx + (3xy? — 1)dy = 0, 

and from its general integral state an integrating factor. 

5. Let 

f(z, y, 0) = 90 
be a family of plane curves. By eliminating the constant ¢ between this 
and the equation bibs 
bx + dy y= 0, 
we get the differential equation 
F(x, yy’) = 0 
of the family of curves (cf. p. 455). Now let 9(p) be a given function of p; 
a curve C satisfying the differential equation 
F(x, y, o(y’)) = 9 
is called a trajectory of the family of curves f(z, y, ©) = 0. The second 
and third equations show that 
y’ = oY’) 

is the relation between the slope Y’ of C at any given point, and the slope 


y’ of the curve f(z, y, c) = 0 passing through this point. The most impor- 
tant case is 9(p) = —1/p, leading to the equation 


F mw — 2) = 0, 
y 
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which is the differential equation of the orthogonal trajectories of the family 
of curves (cf. p. 456). 

Use this method to find the orthogonal trajectories of the following 
families of curves: 


(a) a®-} y2+ cy~1=0. (b) y= cat, 
we y? 

0) Gee Bye 
(dq) y=cosx+e, (e:) (e—ePt + yr =at 

In each case draw the graphs of the two orthogonal families of curves. 


6. For the family of lines y = cx find the two families of trajectories 
in which (a) the slope of the trajectory is twice as large as the slope of the 
line; (6) the slope of the trajectory is equal and of opposite sign to the 
slope of the line. 


7. Differential equations of the type 


1 (a>b>0, —BP<c< o). 


y=ap+ Vp), p=y’ 


were first investigated by Clairaut. Differentiating, we get 
4d 
[z+ Yee =0, 
ae 


which gives p = ¢ = const., so that 
y= xe + (c) 


is the general integral of the differential equation; it represents a family 
of straight lines. Another solution is 


z= —'(p), 
which together with 


= —pb'(p) + Up) 


gives a parametric representation of the so-called singular integral. Note 
that the curve given by the last two equations is the envelope of the family 
of lines. 

Use this method to find the singular solutions of the equations 


elapse’, 
6) eee 
(0) y= xp -+ e, 
8. Find the differential equation of the tangents to the catenary 


x 
y = acosh~-, 
a 


9. Lagrange investigated the most general differential equation which 
is linear in both a and y, namely, 
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y = 29(p) + W(P)- 


Differentiating, we get 
p= op) + (zor) + VON, 


which is equivalent to the linear differential equation 


’ 7 
dz, oP) 44 Vl) 9, 
dp op)—p 9(p)—P 

provided 9(p) — p + 0 and p is not constant. Integrating and using the 
first equation, we get a parametric representation of the general integral. 
From the second equation we see that the equations 9(p)— p= 0, 
p= const. lead to a certain number of singular solutions representing 
straight lines. 

The solutions can be interpreted geometrically as follows. Consider 
the Clairaut equation 


y= xp + Yo-(p)) 


where @-1(p) is the inverse function of (p), i.e. e—(9(p)) = p. From this we 
see that the solutions of the differential equation are a family of trajec- 
tories of the family of straight lines 


y= ve + Ue Xe)]}, 


y = x9(c) + Yc) (¢ = const). 


or 


Thus e.g. 
y= —2+ Wn) 
P 


is the differential equation of the involutes (orthogonal trajectories of the 
tangents) of the curve which represents the singular integral of the Clairaut 
equation 


1 
nt 3 
y= xp o 
Use this method to integrate the equation 
y= 2p +a)—Up+a). 


10. Express, when possible, the integrals of the following differential 
equations by elementary functions: 


@) (#)'=1-¥; @ (%)'- 4 
@ (%)'- a=, @ @&)- —- 


In each case draw a graph of the family of integral curves, and detect the 
singular solutions, if any, from the figures. 
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ll. A differential equation of the form 
f y, y, »¥')=0 


(note that 2 does not occur explicitly) may be reduced to an equation of 
the first order as follows. Choose y as the independent variable and p = y’ 
as the unknown function. Then 


tp _ Up dy 
y =P, Ye Ga 


and the differential equation becomes f(y, p, pp’) = 0. 

Use this method to solve the following problem: 

At a variable point UM of a plane curve I draw the normal toT’; mark 
on this normal the point N where the normal meets the z-axis and OC, the 
centre of curvature of at M. Find the curves such that 


MN . MC = const. = k. 


Discuss the various possible cases for k > 0 and & <0, and draw the 
graphs. 

12*, Find the differential equation of the third order satisfied by all 
circles 


w+ y®+ 2axr+ Wy+c=—0. 
13. Integrate the homogeneous equation 


(7 1)'= (¢-#) (want) 


and find the singular solutions, 
14*. Solve the differential equation 


y+ ly t+y=0, 


with (0) = 1, y‘(0) = 0, by means of a power series. Prove that this 
function is identical with the Bessel function Jo(z) defined in Ex. 4, p. 223. 


6. Tae PoTentTIaL or ATTRACTING CHARGES 


Differential equations for functions of a single independent 
variable, such as we have discussed above, are usually called 
ordinary differential equations, to indicate that they involve 
only the “ordinary” derivatives of functions of one in- 
dependent variable. In many branches of analysis and its 
applications, however, an important part is played by partial 
differential equations for functions of several variables, that is, 
equations between the variables and the partial derivatives of 
the unknown function. Here we shall touch upon some typical 
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cases of partial differential equations, and shall begin by con- 
sidering the theory of attractions. 

We have already considered the fields of force produced by 
masses according to Newton’s law of attraction, and we have 
represented them as the gradient of a potential ® (cf. Chap. IV, 
p. 283 et seq.). In this section we shall study the potential in 
somewhat greater detail.* 


l. Potentials of Mass Distributions. 


As an extension of the cases considered previously we now 
take 2 as a positive or negative mass or charge. Negative 
masses do not enter into the ordinary Newtonian law of attraction, 
but they do occur in the theory of electricity, where mass is re- 
placed by electric charge and we distinguish between positive 
and negative electricity; Coulomb’s law of attracting charges 
has the same form as the law of attraction of mechanical masses. 
If a charge pw is concentrated at a single point of space with 
co-ordinates (£, 7, ¢), we call the expression p/r, where 


r= V{(e— 6? + y— a)? + (&— OF}, 
the potential + of this mass at the point (a, y, 2). By adding up 
a number of such potentials for different “sources ” or “ poles ” 
(€ 7 &) we obtain as before (cf. p. 283) the potential of a 
system of particles 
o=5h, 
iv; 
The corresponding fields of force are given by the expression 
J= y grad ®, where y is a constant independent of the masses 
and of their positions. 
If the masses, instead of being concentrated at single points or 
“ sources ”’, are distributed with density px(é, 7, ¢) over a definite 
portion R of £y{f-space, we have already taken the potential of 
this mass-distribution to be 


® =f ff P dganae. 


* An extensive literature is devoted to this important branch of analysis; 
see, e.g., Kellogg’s Foundations of Potential Theory (Springer, Berlin, 1929). 

t We could call this a potential of the mass. Any function obtained by adding 
an arbitrary constant to this could equally well be called a potential of the mass, 
since it would give the same field of force. 
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If the masses are distributed over a surface S with surface- 
density y, then the surface integral 


jp 


taken over the surface S with surfacc element do represents the 
potential of this surface, if the surface is given parametrically 
(p. 159 et seg.) by u, v as parameters. 

For the potential of a mass distributed along a curve we 
likewise obtain an expression of the form 


{= ds, 


where s is the length of arc on this curve, (s) the linear density 
of the mass, and r the distance of the point (x, y, z) from the 
point S of the curve. 

For every such potential the surfaces 


® = const. 
represent the equipotential surfaces or level surfaces.* 


As an example of the potential of a line-distribution we take this 
case: a mass of constant linear density u is distributed along the segment 
—lS32S+1 of the z-axis. We consider a point P with co-ordinates 
(x, y) in the plane z= 0; if for brevity we introduce p = +/(z* +- y?), 
the distance of the point P from the origin, we obtain the potential in the 
form 


* +1 dz 
D(z, y= ef J? + 23) +¢. 


Here we have added a constant C to the integral, which does not affect 
the field of force derived from the potential. The indefinite integral on the 
right can be evaluated as in Vol. I, p. 213, and we obtain 


dz _. 8B 2+ a/ (4 + 2) 
a iS inh — = | 
Sve LA pee ° , 


* Curves which at every point have the direction of the force vector are 
called lines of force. The lines of force are therefore curves which everywhere 
intersect the level surfaces at right angles. We thus see that the families of 
lines of force corresponding to potentials generated by a single pole or by a 
finite number of poles run out from these poles as if from a source. In the 
case of a single pole, for example, the lines of force are simply the straight lines 
passing through the pole. 
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so that the potential in the zy-plane is given by 


2 2 
D(z, y) = 2u log avis +0. 


To obtain the potential of a line extending to infinity in both directions 
we give the value —2u. log 2l to the constant * C and thus obtain 


1 RB 2 
O(2z, y) = 2u log ve!) — 2u log p. 


If we now let the length 7 increase without limit, that is, if we let the 
length of the line tend to infinity, the expression {J+ +/(J?-+ o*)}/2I tends 
to unity, and for the limiting value of ®(x, y) we obtain the expression 


D(x, y) = —2y log p. 
We thus see that apart from the factor —2u the expression 

log p = log 4/(2* + y?) 
is the potential of a straight line perpendicular to the xy-plane over which 
a mass is distributed uniformly. 


In addition to the distributions previously considered, 
potential theory also deals with so-called double layers, which 
we obtain in the following way. We suppose that at the point 
(é, 1, £) a charge M is concentrated and at the point (€ ++ h, n, £) 
a charge —M is concentrated. The potential of this pair of 
charges is given by 

M 

Vie— &?+ yah + @— OP 
_ M 

V(e— &—hP + (y— uP + &— OF 
If we let h, the distance between the two poles, tend to zero 
and at the same time let the charge M increase without limit 
in such a way that M is always equal to —p/h, where p is a 
constant, ® in the limit tends to the expression 


rae) 


We call this expression the potential of a dipole or doublet with 


* We make this choice in order that in the passage to the limit /-> » the 
potential ® shall remain finite. 
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its axis in the ¢-direction and with “moment” y. Physically it 
represents the potential of a pair of equal and opposite charges 
lying very close to one another. In the same way we can express 
the potential of a dipole in the form 


ale) 


where 0/dv denotes differentiation in an arbitrary direction », 
that of the axis of the dipole. 

If we imagine dipoles distributed over a surface S with 
moment-density », and if we assume that at each point the 
axis of the dipole is normal to the surface, we obtain an expression 


of the form 
ofl 
J ful O2()ae, 


where 0/dv denotes differentiation in the direction of the positive 
normal to the surface (we can, as before, choose either direction 
of the normal as positive), r is the distance of the point of the 
surface (€, 7, ¢) from the point (a, y, z), and the point (¢, 7, £) 
ranges over the surface. This potential of a double layer can be 
thought of as arising in the following way. On each side of the 
surface and at a distance h we construct surfaces, and we give 
one of these surfaces a surface-density ./2h, the other a surface- 
density —p/2h. At an external point these two layers together 
create a potential which tends to the expression above as h > 0. 
We shall assume that in all our expressions the point (a, y, 2) 
considered is at a point in space at which no charge is present, 
so that the integrands and their derivatives with respect to 
L, y, 2 are continuous. 


2. The Differential Equation of the Potential. 


In virtue of these hypotheses we can obtain a relation which 
all our potential expressions satisfy, namely, the differential 
equation 

®,,+ o,, + ®,,= 0, 
or in abbreviated form 
A® = 0, 


which is known as Laplace’s equation. As we have already 
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(Vol. I, p. 470) verified by simple calculation, this equation 
is satisfied by the expression 1/r. It therefore holds also for all 
the other expressions formed from it by summation or integration, 
since we can perform the differentiations with respect to z, y, 2 
under the integral sign. This differential equation is also satis- 
fied by the potential of a double layer, for in virtue of the re- 
versibility of the order of differentiation * we find that for the 
potential of a single dipole the equation 


a2 t)=Zat=o 


holds. 

Laplace’s equation is also satisfied by the expression 
log +/(x? +- y®) obtained for the potential of a vertical line, as 
we can readily verify (cf. also Chap. II, p. 76). Since this no 
longer depends on the variable z, it in fact satisfies the simpler 
Laplace’s equation in two dimensions, 


Dyg + Dyy = 0. 


The study of these and related partial differential equations forms 
one of the most important branches of analysis. We may, how- 
ever, point out that potential theory is not by any means chiefly 
directed to the search for general solutions of the equation AD = 0, 
but rather to the question of the existence and to the investigation 
of those solutions which satisfy pre-assigned conditions. Thus 
a central problem of the theory is the “ boundary-value 
problem ”, in which we have to find a solution ® of AD=0 
which together with its derivatives up to the second order is 
continuous in a region R, and which has pre-assigned continuous 
values on the boundary of R. 


3. Uniform Double Layers. 


We cannot enter here into a more detailed study of potential 
functions, that is, of functions which satisfy Laplace’s equation 
Au = 0. In this subject Gauss’s theorem and Green’s theorem 

* It must be noted that the differentiation 0/@v refers to the variables (&, y, ¢) 
and the expression A to the variables (x, y, z). Moreover, the function 1/r, con- 


sidered as a function of the six variables (x, y, 2; 7, ¢), is symmetrical in the 
two sets of variables, and therefore satisfies the differential equation 


A® = Deg + Oy, + OE = O 
with respect to the variables (£, , ¢) also. 
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(Chap. V, pp. 388, 390) are among the chief tools employed. It 
will be sufficient to show by some examples how such investi- 
gations are carried out. 

We shall first consider the potential of a double layer with 
constant moment-density » = 1, that is, an integral of the form 


v=f [2 (ae. 


This integral has a simple geometrical meaning. Let us assume 
that each point of the surface carrying the double layer can 
be seen from the point P with co-ordinates (x, y, z), that is, 
that it can be joined to this point P by a straight line which 
meets the surface nowhere else. The surface S, together with 
the rays joining its boundary to the point P, forms a conical 
region A of space. We now state that the potential of the uniform 
double layer, except perhaps for sign, is equal to the solid angle 
which the boundary of the surface 8 subtends at the point P. By this 
solid angle we mean the area of that portion of the spherical 
suriace of unit radius about the point P as centre which is cut 
out of the spherical surface by the rays going from P to the 
boundary of S. We give this solid angle the positive sign when 
the rays pass through the surtace S in the same direction as the 
positive normal v, otherwise we give it the negative sign (cf. 
Ex. 9, p. 408). 

To prove this we recall that the function «= 1/r, when 
considered not only as a function of (2, y, z) but also as a function 
of (£, 7, 2), still satisfies the differential equation 


Au = Uge + Ug + Ug, = V. 


We fix the point P with co-ordinates (x, y, z), and denote the 
rectangular co-ordinates in the conical region R by (€, », £), and 
by a small] sphere of radius p about the point P we cut off the 
vertex from R; the residual region we call R,. To the function 
u = 1/r, considered as a function of (€, 7, ¢) in the region R,, 
we now apply Green’s theorem (Chap. V, p. 390) in the form 


Jf [ Auaganac =f f Fao, 


where S’ is the boundary surface of R, and @/dn denotes differen- 
tiation in the direction of the outward normal. Since Au = 0, 
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the value of the left-hand side is zero.* If we have chosen the 
positive normal direction v on S so as to coincide with the outward 
normal n, the surface integral on the right-hand side consists of 
three parts: (1) the surface integral 


2 Qe=[2C)~ 


over the surface S, which is the expression V considered above 
(p. 474); (2) an integral over the lateral surface formed by the 
linear rays; (3) an integral over a portion T', of the surface 
of the small sphere of radius p. The second part is zero, since 
there the normal direction n is perpendicular to the radius, and 
therefore is tangential to the sphere r= const. For the inner 
sphere with radius p the symbol 0/dn is equivalent to —0/Op, 
since the outward direction of the normal points in the direction 
of diminishing values of r. We thus obtain the equation 


rT e-* 


or 


where on the right we have to integrate over the portion I’, 
of the small spherical surface which belongs to the boundary of 
R,. If we now write the surface element on the sphere with 
radius p in the form do = p#dw, where dw is the surface element 


on the unit sphere, we at once obtain 


V=—f [do 


The integral on the right is to be taken over the portion of the 
spherical surface of unit radius lying in the cone of rays, and 
we see at once that the right-hand side has the geometrical 
meaning stated above; it is the apparent magnitude, except 
for sign, if the normal direction on S is chosen so that it points 


* From this form of Green’s theorem it follows in general that the surface 
integral f i: ot do taken over a closed surface must always vanish when the 


function u satisfies Laplace’s equation Au = 0 everywhere in the interior of 
the surface. 
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outwards * from the conical region R. Otherwise the positive 
sign is to be taken. 

If the surface S is not in the simple position relative to 
P described above, but instead is intersected several times by 
some of the rays through P, we have only to divide the surface 
into a number of portions of the simpler kind in order to 
see that the statement still holds good. The potential of 
the uniform double layer (of moment 1) on a bounded surface 
is therefore, except perhaps for sign, equal to the “apparent” 
magnitude which the boundary has when looked at from the point 
(x, y, 2). 

For a closed surface we see by subdividing it into two bounded 
portions that our expression is equal to zero if the point P is 
outside, and equal to —4rr if it is inside. 

A similar argument shows in the case of two independent 
variables that the integral 


f a (log) ds 
o Ov 


along the curve C, except possibly for sign, is equal to the angle 
which this curve subtends at the point P with the co-ordinates 
(2, y). 

This result, like the corresponding result in space, can also 
be explained geometrically as follows. Let the point Q with the 
co-ordinates (, 7) lie on the curve C. Then the derivative of 
logr at the point Q in the direction of the normal to the curve 
is given by the equation 


() 7] 1 
5 (logr) = 5 (logr) cos(v, 1) = 3 cos(v, 1), 


where the symbol (v, r) denotes the angle between this normal 
and the direction of the radius vector r. On the other hand, 
when written in polar co-ordinates (r, 6) the element of arc ds 
of the curve has the form 

rd0 


cos(v, 7) 


* The negative sign is explained by the fact that with this choice of the 
normal direction the negative charge lies “ next” the point P. 
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(cf. Vol. I, pp. 266 and 280), so that the integral is transformed 
as follows: 
7) 1 rd0 
— (logr)ds = | — a0, 
Siz (logr) ic poate) cos(v, 7) ‘| 
The integral on the right, however, is the analytical expression 
of our statement. 


4. The Theorem of Mean Value. 


As a second application of Green’s transformation we 
prove the following theorem: every potential function, that 
is, every function « which in a certain region R satisfies 
the differential equation Au = 0, has the following mean value 
property: 

The value of the potential function at the centre P of an arbitrary 
sphere of radius x lying completely in the region R is equal to the 
mean value of the function u on the surface 8, of the sphere; that 18, 


u(x, y, 2) = ral [4 


where u(x, y, 2) is the value at the centre P and the value on the 
surface S, of the sphere of radius r. 

To prove this we proceed as follows: let 8, be a concentric 
sphere inside S, with radius 0< p S7. Since Au = 0 every- 
where in the interior of S,, by the footnote on p. 475 we have 


0 
[f Mae=o, 


where 0u/én is the derivative of u in the direction of the outward 
normal to S,. If (€, », ¢) are current co-ordinates and if with 
the point (a, y, 2) a8 pole we introduce polar co-ordinates by the 
equations 


é—a2= pcoospsind, n—y= psingsindé, [—z= pcos6, 
the above equation becomes 


du(p, 8, 
f [eo do = 0. 


Since the surface element do of the sphere S, is equal to p*do, 
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where dé is the element of surface of the sphere S of unit radius 
(cf. Chap. IV, p. 274), we find that if p> 0 


Ou 5. 
Lf5,2=% 


where the region of integration no longer depends on p. Con- 


sequently 
r r) 7 
f dp ag f sO 0, 


and on interchanging the order of integration and performing 
the integration with respect to p we have 


f if {u(r, 8, 6) — u(0, 0, p)}do = 0. 
Since u(0, 6, ¢) = u(x, y, 2) is independent of 6 and ¢, 
f fu, 0, ¢)do = ula, y, 2) f fae = 47u(a, y, 2). 


J fu 6, ¢)do = aS fue. 6, ¢)do, 


where the integral on the right is to be taken over the surface of 
S,, the mean value property of u is proved. 

In exactly the same way, for functions u of two variables 
which satisfy Laplace’s equation Uz, + Uyy—= 0 we have the 
corresponding mean value property of the circle expressed by 
the formula 


As 


2rr u(x, y) = i uds, 


where % denotes the value of the potential function on a circle S, 
with radius r about the point (zx, y) and ds is the element of arc of 
this circle. 


5. Boundary-value Problem for the Circle. Poisson’s Integral. 


As an example of a boundary-value problem we shall now 
discuss Laplace’s equation in two independent variables 2, y 
for the case of a circular boundary. Within the circular region 
a+ y? < R? we introduce polar co-ordinates (r, 0). We wish 
to find a function u(x, y) which is continuous within the circle 
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and on the boundary, possesses continuous derivatives of the 
first and second order within the region, satisfies Laplace’s 
equation Aw = 0, and has prescribed values u(R, 6) = f(0) on 
the boundary. Here we assume that f(@) is a continuous periodic 
function of @ with sectionally continuous first derivatives. 

The solution of this problem, in terms of polar co-ordinates, 
is given by the so-called Poisson’s integral 


Oe sai ee f(2) 
0 OR | aoe 


To prove this, we begin by constructing as many solutions 
of Laplace’s equations as we please in the following way. We 
transform Laplace’s equation to polar co-ordinates, obtaining 


Au = > (ru), + 1 tig = 0, 
r 7 


and seek to find solutions which can be expressed in the form 
u= $(r)¥(6), that is, as a product of a function of 7 and a func- 
tion of 6. If we substitute this expression for u in Laplace’s 
equation, the equation becomes 


de #8) 
Hr) 70) 


‘As the left-hand side does not involve @ and the right-hand side 
does not involve r, the two sides must each be independent of 
both variables, that is, must be equal to the same constant k. 
For y5(0) we accordingly have the differential equation + ky = 0. 

Since the function u and hence also #(8) must be periodic with 
period 27, it follows that the constant k is equal to n?, where 
is an integer. Hence 


(8) = a cosn6 + bsinné, 


where a and b are arbitrary constants. 
The differential equation for ¢(r), 


g(r) + ro'(r) — n?d(r) = 9, 


is a linear differential equation and, as we can immediately verify, 
the functions r* and r—” are independent solutions. Since the 
second solution becomes infinite at the origin, while u is to be 
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continuous there, we are left with the first solution ¢=r", and 
the solutions of Laplace’s equation are 


r"(a cosné + 6 sinné). 


We now use the fact that by linear combination of such 
solutions according to the principle of superposition (cf. section 4, 
p. 438) we can obtain other solutions 


3% + Lr"(a, cosnd + b,, sinné). 


Even an infinite series of this form will be a solution, provided 
that the series converges uniformly and can be differentiated 
term by term twice in the interior of the circle. 

If we now imagine the prescribed boundary function f(60) 
expanded in a Fourier series 


F(8) = 54% +2 (dn cosné + b, sinné), 


this series, regarded as a series in 0, certainly converges absolutely 
and uniformly (cf. Vol. I, Chap. IX, p. 451). Hence the series 


u(r, 6) = Sa, + z - (a, cosn8 +- b, sinn@) 


a fortiort converges uniformly and absolutely in the interior of 
the circle. This series, however, can be differentiated term by 
term, provided r < R#, because the resulting series again converge 
uniformly (cf. the account of power series in Vol. I, Chap. VI, 
p. 399). This function is accordingly a potential function; it has 
the prescribed value on the boundary, and hence is a solution of 
our boundary-value problem. 

We can reduce this solution to the integral form given above 
by introducing the integrals for the Fourier coefficients, 


ar ar 
0, = 1 i f(a) cosnada, 6,= Z f f(a) sinnada. 
ao a Jo 


Since the convergence is uniform, we can interchange integration 
and summation, and obtain 


u(r, 0) = [res +25 cosn(9 — a)} da 
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Poisson’s integral formula is therefore proved, provided that 
we can ee the relation 


1 RR? — 7? 
2 5+ 55 Re 7 cos = "9 R2 — QRr cost + r* 


But this can be proved by the method used in Vol. I, Chap. IX, 
p. 436; we leave the proof to the reader. 


EXAMPLES 

1. By applying inversion to Poisson’s formula, find a potential function 
u(z, y) which is bounded in the region outside the unit circle and assumes 
given values f(0) on its boundary (the so-called outer boundary-value prob- 
lem). 

2*, Find (a) the equipotential surfaces and (6) the lines of force for the 
potential of the segment z= y= 0, —? Sz S +1, of constant linear den- 
sity p. 

3*. Prove that if the values of a harmonic u(x, y, z) and of its normal 
derivative du/an are given on a closed surface S, then the value of wu at any 
interior point is given by the expression 


u(x, y, 2) = z ~ f f (2 Ou — ub) as, 


where r is the distance from the point (x, y, z) to the variable point of 
integration. (Apply Green’s theorem to the functions u and 1/r.) 


7. FurtHER EXaMpPLes OF ParriaL DIFFERENTIAL EQUATIONS 


We shall now briefly discuss a few partial differential equa- 
tions which are of frequent occurrence. 


l. The Wave Equation in One Dimension. . 


The phenomena of wave propagation, e.g. of light or sound, 
are governed by the so-called wave equation. We begin by con- 
sidering the simple idealized case of a so-called ‘“‘ one-dimensional 
wave ”. Such a wave depends on some property u, for example, 
the pressure, the change of position of a particle, or the intensity 
of an electric field; and w depends not only on the co-ordinate 
of position x (we take the direction of propagation as the z-axis) 
but also on the time é. 

The function u(z, ¢) then satisfies a partial differential equation 
of the form 

Cee = a Ute 


Ww (5912) 
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where @ is a constant depending on the physical nature of the 
medium. We can express solutions of this equation in the form 


u = f(e— at), 


where f(é) is an arbitrary function of ¢, about which we 
assume only that it has continuous derivatives of the first and 
second order. If we put ¢== 2 — at, we see at once that our 
differential equation is actually satisfied, for 


Une = f'"(§); ty = a*f”( é). 


In the same way, using another arbitrary function g(£), we 
obtain a solution of the form 


u = g(x + at). 


Both these solutions represent wave motions which are 
propagated with the velocity a along the z-axis; the first 
represents a wave travelling in the positive z-direction, the 
second a wave travelling in the negative z-direction. For 
let u have the value u(a, t) at any point a, at time 4; then uv 
has the same value at time ¢ at the point «= a, -+ a(t — 4). 
For then x—at=2,—at,, so that f(a — at) = f(a, — at). 
In the same way, we can see that the function g(x + at) repre- 
sents a wave travelling in the negative x-direction with velocity a. 

We shall now solve the following initial-value problem for 
this wave equation. From all possible solutions of the differential 
equation we wish to select those for which the initial state (at 
t = 0) is given by two prescribed functions u(x, 0) = (x) and 
u,(a, 0) = (xz). To solve this problem, we have merely to write 


u=f(e— at) -+ gle + at) 
and determine the functions f and g from the two equations 
P(e) = f(z) + 92) 
1 ya) = —f'@) + 9). 


The second equation gives 


0+ = [inde = — fla) + 910), 
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where c is an arbitrary constant of integration. From this we 
readily obtain the required solution in the form 


ua, t) = SE Ee Og eld. 


The reader should prove for himself, by introducing new 
variables £ = «— at, n= 2-+ at instead of w and ¢, that no 
solutions of the differential equation exist other than those given. 


2. The Wave Equation in Three-dimensional Space. 


In the wave equation for space of three dimensions the func- 
tion u depends on four independent variables, namely, the three 
space co-ordinates z, y, 2 and the time t. The wave equation is 
then 


1 
Una + Uyy + Use = 3 Uses 
a 
or, more briefly, 
1 
Au = a Usp 
Here again we can easily find solutions which represent the 
propagation of a plane wave in the physical sense. 
In fact, any function f(€), provided we assume that it is twice 


continuously differentiable, gives us a solution of the differential 
equation, if we make & a linear expression of the form 


&= on + By + yz tat, 
whose coefficients satisfy the relation 
a+ P+ = 1. 
Au= (+ P+ Y)f'(OY)=f"(é) 


Ue = @®f'"(E), 


we see that u == f(ax-+ By + yz - af) is really a solution of the 
equation 


For since 


and 


1 
Au = = Uy 
a 


If g is the distance of the point (x, y, z) from the plane 
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az + By + yz = 0, we know by analytical geometry (cf. Chap. I, 
p. 9) that 

q= on+ By + yz. 
Hence, in the first place, we see from the expression 

u=f(q + a) 

that at all points of a plane at a distance q from the plane 
oz -+ By + yz = 0 and parallel to it the property which is being 
propagated (represented by u) has the same value at a given 
moment. The property is propagated in space in such a way 
that planes parallel to ax+ By + yz=0 are always surfaces 
on which the property is constant; the velocity of propagation 
is a in the direction perpendicular to the planes. 

In theoretical physics a propagated phenomenon of this kind 
is referred to as a plane wave. 

A case of particular importance is that in which the property 
varies periodically with the time. If the frequency of the vibra- 
tion is w, a phenomenon of this kind may be represented by 

u= ethoxt Bot yalgint 


where k, as usual, denotes the reciprocal of the wave-length: 
a) 
k= ym K, 
a 
In the case of the wave equation with four independent 
variables we can find other solutions, which represent a spherical 
wave spreading out from a given point, say the origin. A spherical 
wave is defined by the statement that the property is the same 
at a given instant at every point of a sphere with its centre at 
the origin, that is, that u has the same value at every point of 
the sphere. To find solutions which satisfy this condition, we 
transform Au to polar co-ordinates (r, 0, ¢), and then we have 
merely to assume that u depends on r and ¢ only and not on 0 
and ¢. If we accordingly equate the derivatives of wu with 
respect to 6 and ¢ to zero (cf. p. 391), the differential equation 
becomes 
iis ie Fw 
r a 
or 


(rtt)er = 2 (ru) ss. 
a 
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If for the moment we replace ru by w, w is a solution of the 
equation 
1 


Wor = — Wits 
a 


which we have already discussed, and hence must be expressible 
in the form 


w = f(r — at) + g(r + at). 
Then 


w= > {f(r — at) + (r+ ai)}. 


The reader should now verify for himself directly that a function 
of this type is actually a solution of the differential equation 


Au = 5 Ue. 


Physically the function v= , f(r — at) represents a wave 


which is propagated outwards into space from a centre with 
velocity a. 


3. Maxwell’s Equations in Free Space. 


As a concluding example we shall discuss the system of equa- 
tions, known as Mazwell’s equations, which form the foundations 
of electrodynamics. We shall not, however, attempt to approach 
the equations from the physical point of view, but shall merely 
consider them as illustrating the various mathematical concepts 
developed above. 

The electromagnetic condition in free space is determined by 
two vectors, an electric vector & with components £,, E,, Es, 
and a magnetic vector H with components H,, H,, H;. These 
vectors satisfy Maxwell’s equations: 


curl & + bee 0, 
c ot 


ne ey 
c at 
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where c is the velocity of light in free space. Expressed in 
terms of the components of the vectors, the equations are: 


0B, OH, , 10H, 
T 


OF, OH, , 10H, | 

Oz on + c ot = 

oH, Of, , 10H, 

ox as c ot ” 
and 

oH, 0H, 1 OE, me 


OH, 0H, 108, _ 
ox oy. ¢ Ot 


For the components as functions of position and time we thus 
have a system of six partial differential equations of the first 
order, that is, of equations involving the first partial derivatives 
of the components with respect to the space co-ordinates and to 
the time. 

We shall now deduce some distinctive consequences of Max- 
well’s equations. If we form the “ divergence ” of both equations, 
and remember that div curl.4 = 0 and that the order of differen- 
tiation with respect to the time and formation of the divergence 
is interchangeable, we obtain 


div & = const., 
div H = const.; 


that is, the two “ divergences” are independent of the time. 
If we assume that the constants are initially zero, then they 
remain zero for all time. 
We now consider any closed surface S lying in the field and 
take the volume integrals 


[ f [ewBa 
f[ [ [civ Har 


and 
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throughout the volume enclosed by it. If we apply Gauss’s 
theorem (Chap. V, p. 388) to these integrals, they become integrals 
of the normal components E,, H, over the surface S. That is, 
the equations 


div £ = 0, 
div H = 0 
give 
f E, do = 0, 
8 
f H,do = 0. 
8 


In electrical theory surface integrals 


f Edo or f H,do 


are called the electric or magnetic flux across the surface 8, and 
our result may accordingly be stated as follows: 

The electric flux and the magnetic flux across a closed surface, 
subject to the assumptions we have made above, are zero. 

We obtain a further deduction from Maxwell’s equations if 
we consider a portion of surface S bounded by the curve I and 
lying in the surface. 

Tf we denote the components of a vector normal to the sur- 
face A by the suffix n, it immediately follows from Maxwell’s 
equations that 


_ 10H 
1E 7 
(ent = a 
(curl 7), = g ) 
c at 


If we integrate these equations over the surface with surface 
element do, we can transform the left-hand sides into line in- 
tegrals taken round the boundary I‘ by Stokes’s theorem (cf. 
Chap. V, p. 395). If we do this, and carry out the differentiation 
with respect to ¢ outside the integral sign, we obtain the equations 


fF as——15 ff H,,do, 
T 


[iuds= 425 [Bado, 
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where the symbols #, and H, under the integral signs on the left 
are the tangential components of the electric and magnetic 
vectors in the direction of increasing arc, and the sense of de- 
scription of the curve I’ in conjunction with the direction of the 
normal n forms a right-handed screw. 

The facts expressed by these equations may be expressed in 
words as follows: The line integral of the electric or the mag- 
netic force round an element of surface is proportional to the rate 
of change of the electric or magnetic flux across the element of 
surface, the constant of proportionality being —1/c or +1/e. 

Finally, we shall establish the connexion between Maxwell’s 
equations and the wave equation. We find, in fact, that each of 
the vectors © and A, that is, each component of the vectors, 
satisfies the wave equation 


1 
Au = a Usp. 
For we can eliminate the vector #, say, from the two equations, 
by differentiating the second equation with respect to the time 


and substituting for = from the first equation. 


It then follows that 


1ee 
leur] F + -—_— = 0. 
ce curl cur] & + o OB 0 


If we now use the vector relation * 
curl curl 4 = —AA + grad div A, 


and remember that 


div E= 0, 
we at once have 
1@e 
E=——_., 
2 ce at 


In the same way we can show that the vector 7 satisfies the 
same equation: 

_1e 

 @ OR 


AH 


* This vector relation follows immediately from the expressions in terms 
of co-ordinates. 
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EXAMPLES 
1. Integrate the following partial differential equations: 
(4) Uy, = 0; 
(b)  Ugyg = 05 
(C)  Ugy = a(x, y). 
2*, Solve the equation 
Ugg + 5tigy + Buy, = er +9 
by reducing it to one of the form of Ex. I(c). 


3. Find the partial differential equation satisfied by the two-parameter 
family of spheres 
2i= 1] — (x — a)? — (y — 5). 


4*, Let u(x, t) denote a solution of the wave equation 


1 
Une = Ui» (a > 0) 


which is twice continuously differentiable. Let (x) be a given function 
which is twice continuously differentiable and such that 


(0) = 9(0) = 90) = 0. 
Find the solution u for « 2 0 and t = 0 which is determined by the boun- 
dary conditions 
u(x, 0) = u,(z, 0) = 0 for z = 0, 
u(O, £) = o(f) for t = 0. 
5. Find a solution of the equation 
Uny = Uy 
for which u(z, 0) = u(0, y) = 1, in the form of a power series, 
6. (a) Find particular solutions of the equation 
; uf + uf = 
of the form u = f(x) + g(y). 
(6) Find particular solutions of the equation 
Ugh, = 1 
of the forms u = f(z) + g(y) and w= f(x) g(y). 
7*. Prove that if 


z= u(x, y, a, 6) 


17® (e912) 


490 DIFFERENTIAL EQUATIONS — [Cuar. VI 


is a solution, depending on two parameters a, 5, of the partial differential 
equation of the first order, 


F(2, Y, 2, 2g, Zy) = 0, 


then the envelope of every one-parameter family of solutions chosen 
from z = u(x, y, a, 6) is again a solution. 
8. Use this result to obtain other solutions of equation 6(b) by putting 
b= ka in v= art dy +b (where & is a constant). 
a 


CHAPTER VII 
Calculus of Variations 


1. InrRopuUcTION 


1, Statement of the Problem. 


In the theory of ordinary maxima and minima of a differenti- 
able function f(z, ... ,2,) of m independent variables, the necessary 
condition (p. 184) for the occurrence of an extreme value in a 
certain region of the independent variables is 


df=0 or gradf=0 or f,=0 (s=1,...,n). 


These equations express the stationary character of the function 
f at the point in question. The question whether these stationary 
points are actually maximum or minimum points can only be 
decided after further investigation. In contrast to the equations 
given above, the corresponding sufficient conditions take the 
form of inequalities. 

The calculus of variations is likewise concerned with the 
problem of extreme values (stationary values). Here, however, 
we have to deal with a completely new situation. For now the 
functions which are to have an extreme value no longer depend 
on one independent variable or a finite number of independent 
variables within a certain region, but are so-called functions of 
functions. That is, to determine them we require a knowledge of 
the behaviour of one or more functions or curves (or surfaces, 
as the case may be), the so-called “ argument functions ”. 

General attention was first drawn to problems of this type 
in 1696 by John Bernoulli’s statement of the brachistochrone 
problem. 

In a vertical xy-plane a point A(x, yy) is to be joined to a 
point Bix,, y,), such that x, > x, y, > Yo, by a smooth curve 


y = u(x) in such a way that the time taken by a particle sliding 
491 
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without friction from A to B along the curve under gravity 
(which is taken as acting in the direction of the positive y-axis) 
is as short as possible. 

The mathematical expression of the problem is based on the 
physical assumption that in such a curve y = ¢(z) the velocity 
ds/dt (s being the length of arc of the curve) is proportional to 
V 29(y — Yo), the square root of the height of fall. The time 
taken in the fall of the particle is therefore given by 


T={" dt ds 4 = s/t y) 
i ds de“ “75 = V(y— 4) 


(cf. Vol. I, pp. 299-301). If we drop the unimportant factor 
+/2g and take y) = 0 (which we can do without loss of generality), 
we have the following problem: 

Among all continuously differentiable functions y= ¢(2), 
y 2 0, for which (a) = 0, (#1) = y,, to find that for which 


the integral 
Oe er 


has the least possible value. 

On p. 505 we shall obtain the result, which was very sur- 
prising to Bernoulli’s contemporaries, that the curve y= ¢(z) 
must be a cycloid. Here we wish to emphasize that Bernoulli’s 
problem and the elementary problems of maxima and minima 
are absolutely different. The expression {¢} depends on the 
whole behaviour of the function ¢. It cannot be determined by 
stating the values of a finite number of independent variables, 
that is, it cannot be regarded as a function in the ordinary sense. 
We indicate its character of “function of a function d(x)” by 
means of curly brackets. 

The following is another problem of a similar nature: 

Two points A(z, yy) and B(x,, y,), where x, > %, 4 > 0, 
Y, > 0, are to be joined by a curve y= u(x) lying above the 
%-axis, in such a way that the area of the surface of revolution 
formed when the curve is rotated about the z-axis is as small 
as possible. 

Using the expression given on p. 274 for the area of a surface 
of revolution and dropping the unimportant factor 27, we have 
the following mathematical statement of the problem: 
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Among all continuously differentiable functions y= $(2) 
for which (2) = Yo. $(a%) = yy, $(z) > 0, to find that for which 
the integral 


1{8}—[ywa-+y%de y= se) 


has the least possible value. It will be found that the solution 
is a catenary. 

The elementary geometrical problem of finding the shortest 
curve joining two points A and B in the plane belongs in theory 
to the same category. Analytically, in fact, the problem is that 
of finding two functions 2(é), y(t) of a parameter ¢ in an interval 
t)StSt,, for which the values x(t) = 2%, 2(t)—=% and 
y(to) == Yo y(ty) = y, are prescribed, and for which the integral 


Be : _ de. di 
[verna («=F 5-%) 


has the least possible value. The solution is of course a straight 
line. 

On the other hand, the corresponding problem of finding the 
geodesics on a given surface G(x, y, 2) = 0, that is, of joining two 
points on the surface with co-ordinates (2p, Yo, 2) and (2%, Y;, 21) 
by the shortest possible line lying in the surface, unlike the 
problem of the shortest distance between two points in a plane, is 
not a trivial one. In analytical language this problem is as follows: 

Among all triads of functions a(t), y(é), 2(¢) of the parameter ¢ 
which make the equation 


G(x, y, 2) = 0, 


an identity in ¢, and for which 2(f)) = 2%, y(to) = Yo: 2(to) = 2%; 
w(t) = 2, y(t) = y,, 2(4) = 2%, to find that for which the integral 


[vV@+ e+ aya 


has the least possible value. 

The isoperimetric problem of finding a closed curve of given 
length enclosing the largest possible area, already discussed on 
p. 214, also belongs to the same category. We have proved 
above that the solution is a circle.* 


*The proof given there applied only to convex curves; the following 
remark, however, enables us to extend the result immediately to any curve: 
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The general statement of the simplest type of problems of the 
kind dealt with here is as follows: 

We are given a function F(z, ¢, 4’) of three arguments, which 
in the region of the arguments considered is continuous and has 
continuous derivatives of the first and second orders. If in this 
function F' we replace ¢ by a function y= (x) and ¢’ by the 
derivative y’ = ¢’(x), F becomes a function of x, and an integral 
of the form 


I{d} =[" Fe, yy’) da 


becomes a definite number depending on the behaviour of the 
function y = ¢(2), i.e. it is a “function of the function ¢(x) ”. 
The fundamental problem of the calculus of variations is now 
as follows: 

Among all the functions which are defined and continuous 
and possess continuous first and second derivatives in the interval 
% Sea, and for which the boundary values y= (2p) 
and y, = ¢(2,) are prescribed, to find that for which the integral 
I{¢} has the least possible value (or the greatest possible 
value). 

In discussing this problem the absolutely essential point is 
the nature of the “ conditions of admission” imposed on the 
functions ¢(z). The problem merely requires that when (2) is 
substituted F shall be a sectionally continuous function of z, 
and this is assured if the derivative ¢’(z) is sectionally continuous. 
But we have made the conditions of admission more stringent by 
requiring that the first derivatives, and even the second deri- 
vatives, of the functions ¢(x) shall be continuous. The field 
in which the maximum or minimum is to be sought is of course 
thereby restricted. It will, however, be found that this restriction 
does not, in fact, affect the solution, i.e. that the function which 
is most favourable when the wider field is available will always 


We consider the “‘ convex envelope” K of a curve C (cf. Ex. 2, p. 100), i.e. the con- 
vex curve of least area enclosing the interior of C. This curve K consists of convex 
arcs of C and rectilinear portions of tangents to C, which touch C at two points 
and bridge over concave parts of C by straight lines. It is evident that the area 
of K exceeds that of C, provided C is not convex, and, on the other hand, that 
the perimeter of XK is less than that of C. If we now make K expand uniformly, 
so that it always retains the same shape, until the resulting curve K’ has the 
prescribed perimeter, K’ will be a curve of the same perimeter as C, but en- 
closing a greater area. Hence in the isoperimetric problem we may from the 
outset confine ourselves to convex curves, in order to obtain the maximum area, 
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be found in the more restricted field of functions with con- 
tinuous first and second derivatives. 

Problems of this type occur very frequently in geometry and 
physics. Here we mention only one example. The fundamental 
principle of geometrical optics can be formulated as a variation 
problem of this type. If we consider a ray of light in the xy-plane 
and assume that the velocity of light is a given function v(z, y, y’) 
of the point (x, y) and of the direction y’ (y= ¢(x) being the 
equation of the light-path and y'= ¢/(z) the corresponding 
derivative), then Fermat’s principle of least time is as follows: 

The path of a ray of light between two given points A, Bis 
such that the time taken by the light in traversing it is less than 
the time which light would take to traverse any other path from 
A to B. 

In other words, if ¢ is the time and s the length of arc of any 
curve y= ¢(x) joining the points A and B, the time which 
light would take to traverse the portion of curve between 4 and 
B is given by the integral 


7 * dt ds = *4/(1 + y’) 
I{¢} =f edn =f O(a, ys ¥') aa 


To determine the actual path of the light we accordingly require 
to solve the problem of finding a function y= ¢(x) for which 
this integral has the least possible value. 

We see that the optical problem in this form is actually 
equivalent to the general problem stated above if we relate the 

3 
two functions F and v to one another by putting F = pave i) 
v 

In most optical cases the velocity of light » is independent of 
the direction and is merely a function of position, o(z, y). 


2. Necessary Conditions for Extreme Values. 


Our object is to find necessary conditions that a function 
u = ¢(z) may give a maximum or minimum, or, to use a general 
term, an extreme value, of the above integral I{¢}. Here we 
proceed by a method quite analogous to that used in the ele- 
mentary problem of finding the extreme values of a function of 
one or more variables. We assume that y= ¢= u(z) is the 
solution. Then we have to express the fact that (for a minimum) 
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I must increase when u is replaced by another admissible function 
¢. Here, moreover, as we are merely concerned with obtaining 
necessary conditions, we may confine ourselves to the considera- 
tion of functions ¢ which approximate to w, i.e. functions for 
which the absolute value of the difference ¢ — u remains between 
prescribed bounds. 

We think of the function u as a member of a one-parameter 
family with parameter ¢, constructed as follows. We take any 
function »(x) which vanishes on the boundary of the interval, 
ie. for which (29) = 0, (2) = 0, and which has continuous 
first and second derivatives everywhere in the closed interval. 
We then form the family of functions 


d(z, €) = u(x) + €n(z). 
The expression ¢7(x) = Su is called the variation of the function 
u. (Since 7(x) = 0¢/de, the symbol 8 denotes the differential 
obtained when ¢ is regarded as the independent variable and x 


as a parameter.) Then, if we regard the function u as well as the 
function 7 as fixed, 


I{u+ en} = P(e) = [Fe u-+ en, u’ + en’) dx 


is a function of e; and the postulate that u shall give a minimum 
of I{¢} implies that the function above shall possess a minimum 
for « = 0, so that as necessary conditions we have the equation 


©'(0) = 0 
and further the inequality 
(0) = 0. 


In the same way, if we were seeking a maximum, we should 
have the same equation ©’(0) = 0 and the inequality ®”(0) $0 
as necessary conditions. The condition ®’(0) = 0 must be satis- 
fied for every function 7 which satisfies the above conditions 
but is otherwise arbitrary. 

Putting aside the question of discrimination between maxima 
and minima, we say that if a function wu satisfies the equation 
@’(0) == 0, for all functions », the integral I is stationary for 
¢= u. If, as before, we use the symbol 6 to denote differentiation 
with respect to «, we may also say that the equation 


SI = <'(0) = 0, 


VIlj INTRODUCTION 497 


when satisfied by a function ¢ = u and an arbitrary 7, expresses 
the stationary character of J. The expression 
e= } 


d ™ 
<®’(0) = {¢ fo Fe, ut enw + nde 
deJx, 
is called the variation, or more accurately the first variation,* 
of the integral. Stationary character of an integral and vanishing 
of the first variation, therefore, mean exactly the same thing. 

Stationary character is necessary for the occurrence of maxima 
or minima, but, as in the case of ordinary maxima or minima, 
it is not a sufficient condition for the occurrence of either of 
these possibilities. Here we cannot go into the problem of suffi- 
cient conditions in more detail, and in what follows we confine 
ourselves to the problem of stationary character. 

Our main object is to transform the condition ®’(0) = 0 for 
the stationary character of the integral in such a way that it 
becomes a condition for u only and no longer contains the arbi- 
trary function 7. 


ExaMPLes 


1. In connexion with the brachistochrone problem (see pp. 491, 492), 
ealculate the time of fall when the points A and B are joined by a 
straight line. 

2. Let the velocity of a particle with polar co-ordinates (r, 0, 9) moving 
in three-dimensional space be v = 1/f(r). What time does the particle 
take to describe the portion of a curve given by a parameter o (the co- 
ordinates of a point on the curve being 7(c), 6(¢), 9(¢)) between the points 
A and B? 


2. Euter’s DirFERENTIAL EQuaTION IN THE SrmpLest CasE 


1. Deduction of Euler’s Differential Equation. 


The fundamental criterion of the calculus of variations is as 
follows: 
The necessary and sufficient condition that the integral 


Ig}= [Fe $, #)de 


* From this comes the use of the term calculus of variations, which is meant 
to indicate that in this subject we are concerned with the behaviour of functions 
of a function when this independent function or “ argument function ” is made 
to vary by altering a parameter e. 
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shall be stationary when 6 = u is that ¢ = u shall be an admissible 
function satisfying Euler’s differential equation* 


d 
Lu] = F,-— — Fy, =0 
(eles He ee, 
or, in full, 
Bye + Bw + Bay — P=. 


To prove this we note that we can differentiate the expression 
D(e) = f “Flo, ut en, u’ + en!) de 


with respect to « under the integral sign (cf. Chap. IV, § 1, 
p. 218), provided that the differentiation gives rise to a con- 
tinuous function, or at least a sectionally continuous function of 
x, under the integral sign. In this case, on putting u-+- en = y 
and differentiating, we obtain under the integral sign the 
expression 7/, + 7'F,,, which, owing to the assumptions made 
about f, u, and 7, satisfies the conditions just stated. Hence 
we immediately obtain 


G0) =f" OF +1 Fyjde=0 (Fle, wu). 


For subsequent purposes (see the next page), we note that in 
the formation of this equation we have used nothing beyond the 
continuity of the functions wu and y and the sectional continuity 
of their first derivatives. In this equation the arbitrary function 
appears under the integral sign in a twofold form, namely, as 
ny and y’. Wecan, however, immediately get rid of 4’ by integra- 
tion by parts; we have 


f va! Fda =F, i f "WE Fy) dz = — f “(Z Fe) dz, 


for by hypothesis 7(z9) and (2,) vanish. In this integration by 


parts we have to assume that the expression 7 F,, can be formed, 


but this assumption certainly holds good, for we began by assum- 
ing continuity of the second derivatives. Hence, if we write 


d 
Lu] = F,— 5 Pe 


* The terms principal equation, characteristic equation are also used. 
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for brevity, we have the equation 


[Hulda = 0. 
Xo 
Now this equation must be satisfied for every function » which 
satisfies our conditions but is otherwise arbitrary. We thus 
conclude that 
L[u] = 90, 

in virtue of the following 

Lemma 1.—If a function C(z) which is continuous in the 
interval under consideration satisfies the relation 


['re\C@)ae = 0, 


where 7(z) is any function such that 7(%) = 7(x,) = 0 and 
n’(x) is continuous, then C(x) = 0 for every value of » in the 
interval. The proof of this lemma will be postponed to the next 
sub-section (p. 501). 


We could, however, obtain our condition in a different way,* by 
getting rid of the term in 7 in the equation 


My 
(nFy + Py) dx =0 
Xe 
by integration by parts. For if we write Fy = A, F,=b= B’ for 
brevity and remember the boundary condition for 4, on integrating by 
parts we obtain 


xy oy My 
f qi ,dx = if 7B’da=— | 'Bdx. 
Xo Xe 


Xo 


If we put ¢ = 7, we have the condition 


“¢(4 — B)dx = 0. 


Xo 


In this method we need not make any further assumptions about 
the second derivatives of 7 and uw. On the contrary, it is sufficient to 
assume that 9 (or ) and 7 are continuous and have sectionally continuous 
first. derivatives. Now our equation must hold, not, it is true, for any 
arbitrary (sectionally continuous) function ¢t, but only for those functions ¢ 
which are derivatives of a function »(z) satisfying our conditions. If, 


* The first method is due to Lagrange, the second to P. Du Bois Reymond. 
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however, ¢(x) is any given sectionally continuous function satisfying the 
relation 


Ey 
f C(x) da = 0 
Xo 
and otherwise arbitrary, we can pub 
x 
a= f todas 
‘Xo 


we have then constructed an admissible », for y’ = C and (2) = v(x) = 0. 
We thus obtain the following result: 
A necessary condition that the integral should be stationary is 


“(A — B)dx = 0, 


*o 


where ¢ is an arbitrary sectionally continuous function merely satisfying 


the condition 
xy 
f Uda = 0. 
Xo 
We now require the help of 
Lemma I1.—If a sectionally continuous function S(x) satisfies the 
condition 
ey 
f (Sde = 0, 
%o 
for all functions ¢(x) which are sectionally continuous in the interval and 
for which 
oN 
Cdx = 0, 
Xo 
then S(x) is a constant c. 
This lemma will also be proved in the next sub-section (p. 501). If 
meanwhile we assume its truth, it follows, if we substitute the above 
expressions for 4 and B, that 


* 
nA Fide te= Fy. 
Xe 
The left-hand side regarded as an indefinite integral may be differentiated 


with respect to and has F,, as its derivative; the same is therefore true 


d 
of the right-hand side. Hence the expression dni for the supposed 
solution u exists, and the equation 


holds. 
Thus Euler’s equation still remains the necessary condition for an 
extreme value, or the condition that the integral should be stationary, 
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when the class of admissible functions (x) is extended from the outset by 
requiring only sectional continuity of the first derivative of 9(z). 


Euler’s equation is an ordinary differential equation of the 
second order. Its solutions are called the extremals of the minimum 
problem. To solve the minimum problem we have, among all 
the extremals, to find one which satisfies the prescribed boundary 
conditions. If “ Legendre’s condition ” 


Fy ul = 0 
is satisfied for f = u(x), the differential equation can be brought 


into the “regular” form w= f(2, u, u’), where the right-hand 
side is a known expression involving 2, u, u’. 


2. Proofs of the Lemmas. 

We have now to prove the two lemmas used above. 

To prove Lemma I, we assume that at some point, say 7 = é, 
C(x) is not zero, say positive. Then in virtue of the continuity 
of C(x) we can certainly mark off a sub-interval 


é—-as¢tSé+a 


within the complete interval in such a way that C(x) remains 
positive everywhere in the gub-interval. We now define y a8 
given in this sub-interval by 


ala) = (w— E+ a)(a — €— a) = {(w@— 4° — a} 
and elsewhere as zero. This function 7 certainly fulfils all the 
prescribed conditions; 7(x)C(z) is positive inside the sub-interval, 
and zero outside it. The integral f "Cd therefore cannot be 


Xe 
zero.* Since this contradicts our hypothesis, C(é) cannot be 
positive. For the same reasons, C(£) cannot be negative. Hence 
C(é) must vanish for all values of ¢ within the interval, as was 
stated in the lemma. 


To prove Lemma II, we note that our assumption about C(x) im- 
mediately leads to the relations 


“¢(a)dz = 0 and f “¢(z\{8(a) — o}dx = 0, 


* The integral of a continuous non-negative function is positive, except 
when the integrand vanishes everywhere; this follows immediately from the 
definition of integral. 
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where c is an arbitrary constant. We now choose ¢ in such a way that 
S(x)— c is an admissible function ((x), that is, we determine c by the 
equation 


o= if tedae if “(8(e) — o}de = f wG(elde = ol — wah 


Substituting this value of c in the above equation and taking = S(x) — c, 
we at once have 


i *“(8(2) — c}dx = 0. 
Xo 


Since by hypothesis the integrand is continuous, or at least sectionally 
continuous, it follows that 

S(z) -—ce=0 
is an identity in z, as was stated in the lemma, 


3. Solution of Euler’s Differential Equation. Examples. 


To find the solutions wu of the minimum problem we have 
(p. 497) to find a particular solution of Euler’s differential equation 
for the interval zy<2x< 2, which assumes the prescribed 
boundary values y, and y, at the end-points. As the complete 
integral of Euler’s differential equation of the second order 
contains two constants of integration, it is generally possible to 
make these two constants fit the boundary conditions, the latter 
giving two equations which the constants of integration must 
satisfy. 

In general it is not possible to solve Euler’s differential 
equation explicitly in terms of elementary functions or quadra- 
tures. In the general case we have to be content to establish 
the fact that the variation problem does reduce to a problem in 
differential equations. On the other hand, in important special 
cases and, in fact, in most of the classical examples, the equation 
can be solved by means of quadratures. 

The first case is that in which F does not contain the deri- 
vative y’ = ¢’ explicitly: F = F(¢, x). Here Euler’s differential 
equation is simply F,(u, x)= 0; that is, it is no longer a 
differential equation at all but forms an implicit definition of 
the solution y= u(x). Here of course there is no question of 
integration constants or the possibility of satisfying boundary 
conditions. 

The second important special case is that in which F does 
not contain the function y = ¢(z) explicitly: F = F(y’, x). Here 


VI] EULER’S DIFFERENTIAL EQUATION 503 
Euler’s differential equation is 7 (F.,) = 0, which at once gives 


Fy =, 


where c is an arbitrary constant of integration. We may use 
this equation to express u’ as a function, f(#, ¢), of # and c, and 
we then have the equation 


wu = f(z, c), 


from which by a simple integration (quadrature) we obtain 
u=[fléodé+a, 
0 


that is, u is expressed as a function of « and c, together with an 
additional arbitrary constant of integration a. In this case, 
therefore, Euler’s differential equation can be completely solved 
by quadrature. 

The third case, which is the most important in examples and 
applications, is that in which F does not contain the independent 
variable x explicitly: F = F(y, y’). In this case we have the 
following important theorem: 

If the independent variable x does not occur explicitly in the 
variation problem, then 


E= Flu, w)—wFu,wv)=e 


is an integral of Euler’s differential equation. That ts, if we sub- 
stitute in this expression a solution u(x) of Euler’s differential 
equation for ¥', the expression becomes a constant independent of x. 
The truth of this statement follows at once if we form the 

derivative dE/dx. We have 

cee Fy + Pyul! — ul Py — By — UU Py, 

dz 
or 


casi u'L[u] = 0; 
dz 


hence for every solution u of Euler’s differential equation we have 
E = ¢, where c is a constant. 
If we think of u’ as calculated from the equation H=c, 
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say u’ = f(u, c), a simple quadrature applied to the equation 


dz sl 

du f(u, 0) 
gives x = 9(u, c) +- a (where a is another constant of integration), 
Le. x is expressed as a function of u, c, and a, and by solving for w 
we obtain the function u(x, c, a). Hence the general solution 
of Kuler’s differential equation, depending on two arbitrary 
constants of integration, is obtained by a quadrature. 


We shall now use these methods to discuss a number of 
examples. 


1. General Note—There is a general class of examples in which F is 
of the form F = g(y)V(1+ y’), where g(y) is a function depending 
explicitly on y only. For the extremals y == u our last rule at once gives 

gu) V (1 + wu’) — g(u)u’?/V(1 + u’8) = 6 
or 
9) 
V(1 + uw) 


Ut a 1, 


1 
Viguylet = 1 
and on integrating we have the equation 
du 


*— b= (Rea 


where 6 is another constant of integration. By evaluating the integral on 
the right and imagining the equation solved for u, we obtain wv as a function 
of x and of the two constants of integration c and b. 


2. The Surface of Revolution of Least Area.—tIn this case g=y. The 
integral given above becomes 


z2— b= f= car cosh ”; 
Viaje =] c 
hence the result is 


Oe 
du 


(seis boot! 


That is, the solution of the problem of finding a curve which on rotation 
gives a surface of revolution with stationary area is a catenary. 
A necessary condition for the occurrence of such a stationary curve is 
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that the two given points A and B can be joined by a catenary for which 
y > 0. The question whether the catenary really represents a minimum 
cannot be discussed here. 


3. The Brachistochrone.—Another example is obtained by taking 
g=1/vy. This is the problem of the brachistochrone. By means of the 
substitutions 1/e? = k, w= kt, 7 = sin? /2 the integral 


du 
f V1 f(uct) — 1 


is immediately transformed into 


ree if [(.)e= yk [1 — 080)a0, 


2 — b= 110 — ain®), 


whence 


= u= 4k(1 — cos8). 


The brachistochrone is accordingly (cf. Vol. I, p. 261) a common cycloid 
with its cusps on the z-axis. 


EXAMPLES 
Find the extremals for the following integrands: 


LFHeVyity®. 2% F=Vit+ yy. 3. F=yV1— y®. 


4. Find the extremals for the integrand F = afy’?, and prove that if 
n = 1 two points lying on opposite sides of the y-axis cannot be joined by 
an extremal. 

5. Find the extremals for the integrand y"y"™, where » and m are even 
integers. 

6. Find the extremals for the integrand F = ay’? + 2byy’ + cy*, 
where a, b, ¢ are given continuously differentiable functions of 2. Prove 
that Euler’s differential equation is a linear differential equation of the 
second order. Why is it that when 6 is constant this constant does not 
enter into the differential equation at all? 

7. Show that the extremals for the integrand F= eV/1+ y” are 
given by the equations sin(y — b) = e—@—@ and y= 6, where a, b are 
constants. Discuss the form of these curves, and investigate how the 
two points A and B must be situated if they can be joined by an extremal 
arc of the form y = f(z). 

8. For the case where F does not contain the derivative y’, deduce 
Euler’s condition F, = 0 by an elementary method. 


9. Find a function giving the absolute minimum of 


Iy)= fo ytd 
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with the boundary conditions 
(2) (0) = y(1) = 0; 
(+) x0) =9, YI)=1L 


4. Identica) Vanishing of Euler’s Expression. 


Huler’s differential equation for F(z, y, y’) may degenerate 
into an identity which tells us nothing, i.e. into a relation which 
is satisfied by every admissible function y = ¢(z). In other words, 
the corresponding integral may be stationary for any admissible 
function y = ¢(x). If this degenerate case is to occur, Kuler’s 
expression F, — Fry — Fyyy’ — Fyyy"” must vanish at every 
point x of the interval, no matter what function y= ¢(z) is 
substituted in it. We can, however, always find a curve for 
which y= ¢, y’ = ¢’, and y” = ¢” have arbitrary prescribed 
values at a definite point. LEuler’s expression must therefore 
vanish for every quartet of numbers a, y, y’, y’’. We conclude that 
the coefficient of y”, ie. F,,, must vanish identically. F must 
therefore be a linear function of y’, say F = ay’ + b, where a and 
b are functions of z and y only. If we substitute this in the 
remaining part of the differential equation, 


Fyyy ++ Puy — F,= 0, 
it follows at once that 


, , 
a,y’ + a, — ayy’ — by, 
or 
ay — by, 


must vanish identically in 2 and y. In other words, Euler’s ex- 
pression vanishes identically if, and only if, the integral is of the 
form 


I = [{ae, yyy’ + B(a, y)} da = fady + bdz, 


where a and b satisfy the condition of integrability which we 
have already met with in Chap. V, § 1 (p. 353), that is, where 
ady + bdz is a perfect differential. 
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3. GENERALIZATIONS 


1. Integrals with More than one Argument Function. 


The problem of finding the extreme values (stationary values) 
of an integral can be extended to the case where this integral 
depends not on a single argument function but on a number 
of argument functions ¢,(z), $(x), ..-, n(x). The typical problem 
of this type may be formulated as follows: 

Let F(x, ¢, .--> dns $y, --+» On) be a function of the 
(2n + 1) arguments 2, ¢,,..-., dn, Which is continuous and has 
continuous derivatives up to and including the second order in the 
interval under consideration. If we replace y, = ¢, by a function 
of x with continuous first and second derivatives, and ¢,' by its 
derivative, F becomes a function of the single variable 2, and the 
integral 


I{4y, tees b}=[ Fe, Gy, +++ Pn fy» sees Pn )da 


over a given interval 2 S xz < a, has a definite value determined 
by the choice of these functions. 

In the comparison we regard all functions ¢,(x) as admissible 
which satisfy the above continuity conditions and for which the 
boundary values ¢,(x) and ¢,(z,) have prescribed fixed values. 
In other words, we consider the curves y; = ¢,(x) joining two 
given points A and B in (n + 1)-dimensional space in which the 
co-ordinates are ¥,, Ya, ---» Yn, Z The variation problem now 
requires us to find, among all these systems of functions 
(xz), one (y;= $2) = ux)) for which the above integral 
I{$,, .. +, dn} has an extreme value (a maximum or a 
minimum). 

Here again we cannot discuss the actual nature of the 
extreme value, but shall confine ourselves to inquiring for 
what systems of argument functions ¢,(z) = u,(x) the integral is 
stationary. 

We define the concept of stationary value in exactly the same 
way as we did in § 1 (p. 496). We include the system of functions 
ux) in a one-parameter family of functions depending on the 
parameter «, in the following way. Let 7,(2), ..., a(x) be » arbi- 
trarily chosen functions which vanish for x = 2 and x = a, are 
continuous in the interval, and possess continuous first and 


508 CALCULUS OF VARIATIONS [CHap. 


second derivatives there. Then we consider the family of functions 
Yi = $x) = u(x) + en,(a). 

The term en,(z) = Su, is called the variation of the function 
u;. If we substitute the expressions for 4, in [{4,,..., dn}, this 
integral is transformed into 


D(e) =f Fe, Uy €N, 22) Unt Nn; Uy! + en’; waey Un’ en, ) da, 


which is a function of the parameter «. A necessary condition 
that there may be an extreme value for ¢, = u,, i.e. for «= 0, 
is 

©'(0) = 0. 


Just as in § 1, p. 496, we say that if the equation ©’(0) = 0 
holds, or, as we may also say, if the equation 


SI = «®’(0) = 0 


holds, no matter how the functions 4, are chosen subject to the 
conditions stated above, the integral J has a stationary value for 
¢:= u, In other words, stationary character of the integral for 
a fixed system of functions u,(z) and vanishing of the first varia- 
tion 82 mean the same thing. 

We have still the problem of setting up conditions for the 
stationary character of the integral which no longer contain the 
arbitrary variations 7; To do this we do not require any new 
ideas, but proceed as follows. If we take 7g, 3, ..., %, a8 identi- 
cally zero, i.e. if we do not let the functions wp, ..., u, vary, 
and thus consider the first function ¢,(z) as alone variable, the 
condition ©’(0) = 0, by § 2, p. 498, is equivalent to Euler’s 
differential equation 

d 
F,, z Fy. = 0. 
As we can pick out any one of the functions u,(x) in the same way, 
we obtain the following result: 

A necessary and sufficient condition that the integral 
T{u,, uy, ..., u,} may be stationary is that the n functions u,(x) 
shall satisfy the system of Euler’s equations 

a 


Fu ~ 5 Puy = 0 (@= 1, 2,..., 7). 
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This is a system of differential equations of the second order, 
n in number, for the n functions u,(z). All solutions of this 
system of differential equations are said to be extremals of the 
variation problem. Thus the problem of finding stationary values 
of the integral reduces to the problem of solving these differential 
equations and adapting the general solution to the given boundary 
conditions.* 


2. Examples. 


The possibility of giving a general solution of the system of 
Euler’s differential equations is even more remote than in the 
case in § 2. It is only in very special cases that we can find all the 
extremals explicitly. Here the following theorem, analogous to 
the particular case on p. 503, is often useful: 

If the function F does not contain the independent variable 
x explicitly, F = F(¢,,..-, bn b's +++ $n), then the expression 


E= Fw, veey Un Uy, oe. > Un’) — Lu Fy 
i=1 
is an integral of Euler’s system of differential equations. That 
is, if we consider a system of solutions ux) of Euler’s system 
of differential equations, we have for this solution 


E= F — Xu/F,, = const. = ¢, 


where, of course, the value of this constant depends upon the 
system of solutions which is substituted. 

The proof follows the same lines as in § 2 (p. 503); we differen- 
tiate the left-hand side of our expression with respect to x and, 
using Euler’s differential equations, verify that the result is 
zero. 


A trivial example is the problem of finding the shortest distance between 


* Using Lemma II (§ 2, p. 500), we can prove that these differential equations 
must hold, under the general assumption that the admissible functions need 
only have sectionally continuous first derivatives. For the beginner who wishes 
to concentrate on the essential mechanism of the subject, however, it is more 
convenient to include continuity of the second derivatives in the conditions of 
admissibility of the functions ¢,(z). We can then work out the expressions 


2 F,,/ and write them in the more explicit form 


n n 
2D Pay'uyte” + 2D Puate + Pou 
Ne Uy! + ° 
kewl 5 kml shod 4 
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two points in three-dimensional space. Here we have to determine two 
functions y = y(x), z= 2(x) such that the integral 


[Vast e+ ede 
Xo 


has the least possible value, the values of y(x) and 2(x) at the end-points 
of the interval being prescribed. Euler’s differential equations give 
y’ _d 2! =0 
(I+ y+ 2%) deV(lty%+2%) 
whence it follows at once that the derivatives y’(x) and z’(x) are constant; 
hence the extremals must be straight lines. 
Somewhat less trivial is the problem of the brachistochrone in three 
dimensions. (Gravity is again taken as acting along the positive y-axis.) 
Here we have to determine y = y(x), z= 2(”) in such a way that the 


integral 
xy 1} +] My 
ih a (= ae = f Fly, y’, 2’) dx 
ty y %o 


is stationary. Euler’s differential equations give 
z 1 ay 
Vy Vl + y+ 2) 

ss eee ey 

Vy Vil es y + 2’2) 7 

where a@ and 6 are constants. By division it follows that 2’ = a/b = k 


is likewise constant. The curve for which the integral is stationary must 
therefore lie in a plane z = kx -++ h. From the further equation 


1 1 =% 
Vy Vit B+ yy) 
there follows the fact, obvious from § 2 (p. 505), that this curve must again 
be a cycloid, 


a, 


F— y Fy — vFy = 


EXAMPLE 


Write down the differential equations for the path of a ray of light 
in three dimensions in the case where (polar co-ordinates r, 0, 9 being 
used) the velocity of light is a function of r (cf. § 1, Ex. 2, p. 497). Show 
that the rays are plane curves. 


3. Hamilton’s Principle. Lagrange’s Equations. 


Huler’s system of differential equations has a very impor- 
tant bearing on many branches of applied mathematics, especially 
dynamics. For the motion of a mechanical system consisting 
of a finite number of heavy particles can be expressed by the 
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condition that a certain expression, the so-called Hamilton’s 
integral, is stationary. Here weshall briefly explain this connexion. 

A mechanical system has n degrees of freedom if its position 
is determined by nm independent co-ordinates 91, Jo, +--+» Gn 
If, for example, the system consists of a single particle, n = 3, 
since for 4), qa) gj we can take the three rectangular co-ordinates 
or the three polar co-ordinates. Again, if the system consists of 
two particles which are held at unit distance apart by a rigid 
connexion—assumed to have no mass—then n= 5, since for 
the co-ordinates g; we can take the three rectangular co-ordinates 
of one particle and two other co-ordinates determining the 
direction of the line joining the two particles. 

A dynamical system can be described with sufficient generality 
by means of two functions, the kinetic energy and the potential 
energy. If we think of the system as moving in any way, the 
co-ordinates q, will be functions q(t) of the time ¢, the “com- 
ponents of velocity ” being ¢;= dq,/dt. Then associated with 
the dynamical system there is a function which we call the 
kinetic energy and which is of the form 


T(h, ae] In da ae) Gn) =3 x Gude (Aix a yg). 
i,k=1 


The kinetic energy, therefore, is a homogeneous quadratic ex- 
pression in the components of velocity, the coefficients a,;, being 
taken as known functions, not depending explicitly on the time, 
of the co-ordinates 9,, ..-, %n themselves.* 

In addition to the kinetic energy, the dynamical system 
is supposed to be characterized by another function, the poten- 
tial energy U(q,, . . - » Gn), Which depends on the co-ordinates of 
position g; only and not on the velocities or the time. 

Now Hamilton’s principle is as follows: the actual motion of 


* We obtain this expression for the kinetic energy T by thinking of the 
individual Rostenen ty co-ordinates of the particles of the system as expressed 
as functions of the co-ordinates g,,.--, %- Then the rectangular velocity 
components of the individual particles can be expressed as linear homogeneous 
functions of the @,’s, and finally the elementary expression for the kinetic 
energy is formed, namely, half the sum of the products of the individual masses 
and the squares of the corresponding velocities. 

{ As is shown in dynamical textbooks, this potential energy determines the 
external forces acting on the system. In bringing the system from one position 
into another mechanical work is done; this is equal to the difference between 
the corresponding values of U and does not depend on the path by which the 
transference from one position to another takes place. 
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a dynamical system in the interval of time i Sti, from a 
given initial position to a given final position is such that for this 
motion the integral 


Hf, «+5 In} =e —U)dt 


is stationary, if in the comparison we include all continuous 
functions g,(¢) which have continuous derivatives up to and includ- 
ing the second order and which for t=) and t= ¢, have the 
prescribed boundary values. 

This principle of Hamilton’s is a fundamental principle of 
dynamics. The advantage of it is that it forms a brief summary 
of the laws of dynamics. When applied to Hamilton’s principle, 
the general theory of this chapter gives Lagrange’s equations, 


aa a@__ ou 
di 0g, 0g; 04: 
which are the fundamental equations of higher dynamics. 

Here we shall merely make one noteworthy deduction, namely, 
the law of the conservation of energy. 

Since the integrand in Hamilton’s integral does not depend 
explicitly on the independent variable ¢, the solution q,(¢) of the 
differential equations of dynamics must be such as to make the 
expression 


. (@=1, 2,...,n) 


at — UV) 

Og: 
constant. Since U does not depend on the q,’s and T is a homo- 
geneous quadratic function in them (cf. p. 109), 


E=T~—U— 34, 


, (7 — 0) , or 
>» —— = Lig; — = 27. 
ze Og; : 0g; 
Hence 
T + U = const.; 


that is, during the motion the sum of the kinetic energy and the 
potential energy does not vary with the time. 


4. Integrals Involving Higher Derivatives. 

Methods analogous to those used in the examples discussed 
previously can be used to attack the problem of the extreme 
values of integrals in which the integrand F not only contains 
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the required function y= ¢ and its derivative ¢’, but also in- 
volves higher derivatives, e.g. the second. For example, suppose 
we wish to find the extreme values of an integral of the form 


1$}=f “Fe, $$, "dz, 


where in the comparison those functions y = ¢(z) are admissible 
which, together with their first derivatives, have prescribed 
values at the end-points of the interval, and which also have 
continuous derivatives up to and including the fourth order. 

To find necessary conditions for an extreme value we again 
assume that y= u(x) is the desired function. We then include 
it in a family of functions y= (x) = u(x) + en(z), where ¢€ is 
an arbitrary parameter and (2) an arbitrarily-chosen function 
with continuous derivatives up to and including the fourth order, 
which together with its derivatives vanishes at the end-points. 
The integral then takes the form ®(e), and the necessary condition 


0'(0) = 0 


must be satisfied for all these functions (x). Proceeding in a 
way analogous to that in § 2 (p. 498), we differentiate under the 
integral sign and thus obtain the above condition in the form 


[ OFi+ Fe + 1 Pu)de = 0, 
Xe 


which must be satisfied if u is substituted for ¢(x). Integrating 
once by parts we reduce the term in 7’(x) to one in 7, and integrat- 
ing twice by parts we reduce the term in 7’"(x) to one in y; taking 
the boundary conditions into account, we easily obtain 


t d a 
—— F,+ — Ff, )dx= 0. 
l. (Fe ek + Fy) c= 0 


"Hence the necessary condition for an extreme value, i.e. that the 
integral may be stationary, is Huler’s differential equation 
d a 
L =F,— —Fy —. F,.= 0. 
Ul da” * ae 
The reader can verify for himself that this is a differential equa- 


tion of the fourth order. 
18 (E912) 


514 CALCULUS OF VARIATIONS [CuaP 


EXAMPLE 
Consider 


r= {(e"* — 2fede, 


where f(x) is a given function. Here Euler's differential equation is 
ul? — f(x) = 0. 


5. Several Independent Variables. 


The general method for finding necessary conditions for an 
extreme value can equally well be applied when the integral 
is no longer a simple integral but a multiple integral. Let 
D be a given region bounded by a sectionally smooth curve [ 
in the ay-plane. Let F(a, y, , de, ¢y) be a function which is 
continuous and twice continuously differentiable with respect to 
all five of its arguments. Ifin F we substitute for ¢ a function 
d(x, y), which has continuous derivatives up to and including 
the second order in the region D and has prescribed boundary 
values on [, and if we replace ¢, and ¢, by the partial deriva- 
tives of ¢, F becomes a function of x and y, and the integral 


1{$}= f [Ft y, $, bo b)dedy 


has a value depending on the choice of ¢. The problem is that 
of finding a function ¢= u(x, y) for which this value is an 
extreme value. 

To find necessary conditions we again use the old method. 
We choose a function 7(z, y) which vanishes on the boundary I, 
has continuous derivatives up to and including the second order, 
and is otherwise arbitrary; we assume that u is the required 
function and then substitute ¢ = u-+ ey in the integral, where 
eis an arbitrary parameter. The integral again becomes a function 
@(e) and a necessary condition for an extreme value is 


©'(0) = 0. 
As before, this condition takes the form 
J faut nF e+ Fs) dady = 0. 


To get rid of the terms in 7, and 7, under the integral sign we 
regard the double integral as a repeated integral, and integrate 
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one term by parts with respect to x and the other with respect 
to y. Since y vanishes on I’, the boundary values on I fall out, 
and we have 


[Srl =. or, = 5 F,) dedy = 0. 


Lemma I of § 2 (p. 499) can be extended at once to more 
dimensions than one, and we immediately obtain Huler’s partial 
differential equation of the second order, 


rf) ] 
— — Ff, — Ff, = 90. 
f,, on Uy oy Uy 
EXAMPLES 


1. F= 9,27+ 9,3. If we omit the factor 2, Huler’s differential equation 
becomes 
AU = Ugey + Uyy = 0. 
That is, Laplace’s equation has been obtained from a variation problem. 


2. Minimal Surfaces. Plateaws Problem.—To find a surface z = f(x, y) 
over the region D, which passes through a prescribed curve in space whose 
projection is T, and whose area 


{f-va + oa + 9,7) dady 


is a minimum. 
Here Euler’s differential equation is 


o Uy a Q U, 
a2 4/(1 + ug? + uy*) 
or, in expanded form, 


Waal + thy?) — Let pyteathy + Uyy(1 + u,*) = 0. 


y 


dy J + ue + u) 


> 


This is the celebrated differential equation of minimal surfaces, which we 
cannot discuss further here. 


6. Problems Involving Subsidiary Conditions. Euler’s Multiplier. 


In discussing the theory of ordinary extreme values of func- 
tions of several variables in Chapter III, § 6 (p. 191) we con- 
sidered the case where these variables are subject to certain 
subsidiary conditions. In this case the method of undeter- 
mined multipliers led to a particularly clear expression for the 
conditions that the function may have a stationary value. An 
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analogous method is of even greater importance in the calculus of 
variations. Here we shall briefly discuss the simplest cases only. 

(a) Ordinary Subsidiary Conditions—As a typical case we 
consider that of finding a curve z= 2(t), y= y(t), 2= 2) 
(¢ StS) in three-dimensional space, expressed in terms of 
the parameter ¢, subject to the subsidiary condition that the 
curve shall lie on a given surface G(x, y, z) = 0 and shall pass 
through two given points A and B on that surface. What we 
have to do, then, is to make an integral of the form 


by 
if F(z, y, 2, & y, 2) dt 
te 


stationary by suitable choice of the functions x(t), y(t), 2(t), 
subject to the subsidiary condition G(a, y, z) = 0 and the usual 
boundary conditions and ccntinuity conditions. 

This problem can be immediately reduced to the cases dis- 
cussed in sub-section 1 (p. 507). We assume that z(t), y(t), 2(¢) 
are the required functions. We assume further that on the 
portion of surface on which the required curve is to lie z can be 
expressed in the form z= g(x, y). This is certainly the case if 
G, differs from zero on this portion of the surface. If we assume 
that on the surface in question the three equations G, = 0, 
G, = 0, #,=0 are not simultaneously true and confine our- 
selves to a sufficiently small portion of surface, we can suppose 
without loss of generality that G,=:0. If we then substitute 
z= g(x, y) and ¢=g,%-+9,y under the integral sign, the 
problem becomes one in which x(¢) and y(t) are functions inde- 
pendent of one another. Thus we can immediately apply the 
result of sub-section 1 (p. 508) and write down the conditions 
that the integral J may be stationary, by applying the aforesaid 
result to the integrand 


F(a, Y, G(X, Y¥), BY, Ea + YGy) = H(z, y, &, y). 


We then have the two equations 


d d d 0% 
que Hem oP Pet qe (Pega) — Pega Fe = 0, 
d d d 04 
gti Hem 5 Bet 5 Fins) — Fate —Fig = 0. 
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But d 0% d 0% 
9" da a By’ 


as we see at once on differentiation. Hence we have 


d d 
4 7,— Fat go(4 Fi— Fe) =O 
d d 
qr By+ 95 Fs B,) = 0. 
If for brevity we write 
d 
are Fu= AG, . +--+. (A) 
that is, if we introduce a multiplier A(t), and use the facts that 
Je = —Gx/F. Jy = —G,/G,, we obtain the two further equations 
d 
a Fe F,== Ge + ee ee (A) 


oF, — F,= 6, tee (A) 

We thus have the following condition that the integral may 
be stationary: 

If we assume that G,, G,, G, do not all vanish simultaneously 
on the surface G= 0, the necessary condition for an extreme 
value is the existence of a multiplier A(t) such that the three 
equations (A) given above are simultaneously satisfied in addition 
to the subsidiary condition G(«, y, z)=0. That is, we have 
four symmetrical equations determining the functions <(t), y(t), 
2() and the multiplier A. 

The most important special case of this is the problem of 
finding the shortest line joining two points A and B on a given 
surface G = 0, on which it is assumed that the gradient gradG 
does not vanish. Here 


F=y@+ P+, 

and Euler’s differential equations are 
(ee Lae ee | 

di \/ (a? + yf? + 2) 


Gas 
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d i 
eae = dG 
aves pre“ 
d z 
Se AG, 
aJe+ P+?) 


These equations are invariant with respect to the introduction 
of a new parameter t. That is, as the reader may easily verify 
for himself, they retain the same form if ¢ is replaced by any other 
parameter += 7(t), provided that the transformation is one-to- 
one, reversible, and continuously differentiable. If we take the 
arc as the new parameter, in other words, if we assume that after 
the introduction of the new parameter 22+ 47+ #—= 1, our 
differential equations take the form 


The geometrical meaning of these differential equations is 
that the osculating planes* of the extremals of our problem 
are orthogonal to the surface G=0. We call these curves 
geodesics of the surface. The shortest distance between two 
points on a surface, then, is necessarily given by an arc of a 
geodesic. 

EXAMPLE 

Show that the same geodesics are also obtained as the paths of a particle 
which is constrained to move on the given surface G = 0, subject to no 
external forces. (In this case the potential energy U vanishes and the 
reader may apply Hamilton’s principle (p. 512).) 

(b) Other Types of Subsidiary Conditions——In the problem 
discussed above we were able to eliminate the subsidiary con- 
dition by solving the equation determining the subsidiary 
condition and thus reducing the problem directly to the type 
discussed previously. With other kinds of subsidiary conditions 
which frequently occur, however, it is not possible to do this. 
The most important case of this type is the case of “ isoperi- 
metric” subsidiary conditions. The following is a typical 
example. 

With the previous boundary conditions and continuity con- 
ditions, the integral 

* Le. the planes containing the vectors (%, y, z) and (%, #, 2) (cf. Ex. 1, 2, 4, 
pp. 93-4). 
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I{g}=[ "Fee, ¢ de 


is to be made stationary, the argument function ¢(x) being 
subject to the further subsidiary condition 


A{¢}= a “Gla, ¢, ¢’)dz = a given constant ce. 
A particular case of this (F = ¢, G = »/(1 + ¢”)) is the classical 


isoperimetric problem. 

This type of problem cannot be attacked by our previous 
method of forming the “ varied ” function 6 = u + en by means 
of an arbitrary function 7(x) vanishing on the boundary only. 
For in general these functions do not satisfy the subsidiary 
condition in a neighbourhood of « = 0, except at «= 0. We can 
attain the desired result, however, by a method similar to that 
used in the original problem, by introducing, instead of one 
function 7 and one parameter ¢, two functions 7,(x) and 7,(z), 
which vanish on the boundary, and two parameters «, and «. 
Assuming that ¢= wu is the required function, we then form 
the varied function 


b= Ut ems + ee 


If we introduce this function into the two integrals, we obtain 
the following as a necessary condition for an extreme value or 
stationary character of the integral 


I =f Fo, U+ &yM + aM, UW + ey’ + pq’) da = O(a, €2), 
subject to the subsidiary condition 
H =f ‘Ge, U+- Ey + €gMe, Wt Eynr’ + aN’) da = ‘P(e, €2) = c: 


the function ®(e,, €,) is to be stationary for e, = 0, ¢g = 0, where 
€, € satisfy the subsidiary condition 


Fle, &) = . 


A simple discussion, based on the previous results for ordinary 
extreme values with subsidiary conditions, and in other respects 
following the same lines as the account given in § 2 (p. 498), 
then leads to this result: 
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Stationary character of the integral is equivalent to the existence 
of a constant multiplier A such that the equation H = c and Euler’s 
differential equation 


(By + Gy) — (Fy + G,) = 0 


are satisfied. An exception to this can only occur if the function 
u satisfies the equation 
d 


de yw G= 90. 


The details of the proof may be left to the reader, who may 
consult the literature on this subject.* 


EXAMPLES 
1. Use the method of Euler’s multiplier to prove that the solution 
of the classical isoperimetric problem is a circle. 


2. A thread of uniform density and given length is stretched between 
two points A and B. If gravity acts in the direction of the negative y-axis, 
the equilibrium position of the thread is that in which the centre of gravity 
has the lowest possible position. It is accordingly a question of making 


Hy 
an integral of the form is yV (1+ y?)da a minimum, subject to the sub- 


sidiary condition that L V(1-+- y*)dz has a given constant value. Show 
that the thread will hang i in a catenary. 


MiscetLanrous Exameies VII 


1. Show that the geodesics on a cylinder are helices. 
2. Find Euler’s equations in the following cases: 


(2) F=Vv : be y?) + yg(x), 
(6) F= a+ aap + vote )» 
(c) F=y?—y?+y, 

(4) F=vV/(1+y"). 


3. If there are two independent variables, find Euler’s equations in the 
following cases: 


* E.g. O. Bolza, Lectures on the Calculus of Variations (University of Chicago 
Press, 1904); G. A. Bliss, The Calculus of Variations (Open Court Publishing 
Company, Chicago, 1925). 
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(2) F = ap,? + 2bo.Py + Cp,? + od, 
(0) F = (gq + Qyy)® = (Ag), 
(c) F = (Ae)? + (@uePyy — P*ey)= 


4. Find Euler’s equations for the isoperimetric problem in which 
ey 
f (au'? 4 Qbuu! + cut) de 
Xo 


is to be stationary subject to the condition 


Eo 
i wde= 1. 
7) 


5. Let f(x) be a given function. The integral 
Xe) = f ‘S(a\o(a) dz 
is to be made a maximum subject to the integral condition 
H(9) = fgtde= 
(where K is a given constant). 


(a) Find the solution u(x) from Euler’s equation; 


(6) Prove by applying Schwarz’s inequality that the solution found 
in (a) gives the absolute maximum for J. 


ise (8912) 


CHAPTER VIII 


Functions of a Complex Variable 


In Chap. VIII, § 7 (p. 410) of Vol. I we touched on the theory 
of functions of a complex variable and saw that this theory 
throws new light on the structure of functions of a real variable. 
Here we shall give a brief but more systematic account of the 
elements of that theory. 


1. InTRODUCTION 


1. Limits and Infinite Series with Complex Terms. 


We start from the elementary concept of a complex num- 
ber z= @-+ ty (cf. Vol. I, p. 73) formed from the imaginary 
unit ¢ and any two real numbers a, y. We operate with these 
complex numbers just as we do with ordinary numbers, with the 
additional rule that 7? may always be replaced by —1. We re- 
present 2, the real part, and y, the imaginary part of z, by rect- 
angular co-ordinates in an zy-plane or a “ complex z-plane ”. 
The number Z = x — ty is called the complex number conjugate to 
z. If we introduce polar co-ordinates (r, 8) by means of the rela- 
tions z= r cos 6, y= rsin 0, 6 is called the argument (or ampli- 
tude) of the complex number and r = 4/(22 + y*) = V2z=|2| 
its absolute value (or modulus). 

We can immediately establish the so-called “triangle in- 
equality ” satisfied by the complex numbers z,, 2, and 2% + 2, 


la %|Slal+|el, 
and the further inequality 
} 4] — | me] S|] — ve], 


which follows immediately from it, if we put 2, = u, — Ug, 2 = Up. 
522 
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The “ triangle inequality ”’ may be interpreted geometrically as follows: 
we can represent the complex numbers 2,, 2, by vectors in the zy-plane 
with components 2, y, and 22, y, respectively. The vector which repre- 
sents the sum z, + z, is then simply obtained by vector addition of the 
two first vectors. The lengths of the sides of the triangle so formed are 
| 4: |, [22|, [#1 -+ |. Thus the “ triangle inequality ” merely expresses 
the fact that any one side of a triangle is less than the sum of the 
other two. 


The essentially new concept which we now have to con- 
sider is that of the limit of a sequence of complex numbers. We 
state the following definition: a sequence of complex numbers 
z, tends to a limit z provided |z,— | tends to zero. This 
of course means that the real part and the imaginary part of 
2, — 2 both tend to zero. Cauchy’s test applies: the necessary 
and sufficient condition for the existence of a limit z of a sequence 
2, is lim | 2, — %_ | = 0. 


n> wo 
m—>o 


A particularly important class of limits arises from infinite 
series with complex terms. We say that the infinite series with 
complex terms, 

a 
Ze, 


v=0 


converges and has the sum 8, if the sequence of partial sums 


S,= Xe, 


v=0 


tends to the limit S. If the real series with non-negative terms, 
a | ¢, F 
v=0 


converges, it follows, just as in Chap. VIII of Vol. I (p. 369), 
that the original series with complex terms also converges. The 
latter series is then said to be absolutely convergent. 

If the terms c, of the series, instead of being constants, 
depend on (2, y), the co-ordinates of a point varying in a region 
R, the concept of uniform convergence acquires a meaning. The 
series is said to be uniformly convergent in R if for an arbitra- 
rily small prescribed positive ¢ a fixed bound N can be found, 
depending on ¢ only, such that for every n = N the relation 
| S,— S |< holds, no matter where the point z = a + ty lies 
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in the region R. Uniform convergence of a sequence of complex 
functions S,,(z) depending on the point z of R may of course 
be defined in exactly the same way. All these relations and 
definitions and the associated proofs correspond exactly to those 
with which we are already familiar from the theory of real 
variables. 

The simplest example of a convergent series is the geometric 
series 


liz~4+2+24.... 


Just as in the case of the real variable, we have 


we see that the geometric series converges absolutely provided 
[z|<1, and also that the convergence is uniform pro- 
vided |z| <q, where q is any fixed positive number between 
0 and 1. In other words, the geometric series converges absolutely 
for all values of z within the unit circle and converges uniformly 
in every closed circle concentric with the unit circle and with a 
radius less than unity. 

For the investigation of convergence the principle of comparison 
is again available: if|c,| < p,, where 9, is real and non-negative, 

@ 


and if the infinite series & p, converges, then the complex series 
=0 


Xe, converges absolutely. 

If the p,’s are constants, while the c,’s depend on a point z 
varying in R, the series Zc, converges uniformly in the region in 
question. The proofs are word for word the same as the corre- 
sponding proofs for the real variable (Vol. I, Chap. VIII, p. 392) 
and therefore need not be repeated here. 

If M is an arbitrary positive constant and g a positive number 
between 0 and 1, the infinite series with the positive terms p, = Mq’ 


or vMq’-* or fal q’*1 also converge, as we know from Vol. I, 
v 


Chap. VIII, p. 401. We shall immediately make use of these 
expressions for purposes of comparison. 
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2. Power Series. 


The most important infinite series with complex terms are 
power series, in which ¢, is of the form ¢,= 4,2’; that is, a 
power series may be expressed in the form 


Pi) = 2az, 
y=O 
or, somewhat more generally, in the form 
La,(z— %)”. 
v= 


where 2, is a fixed point. As this form can, however, always be 
reduced to the preceding one by the substitution 2’ = z— %, 
we need only consider the case where 2) = 0. 

The main theorem on power series is word for word the same 
as the corresponding theorem for real power series in Chap. VIII 
of Vol. I (p. 399). If the power series converges for 2 == £, wt con- 
verges absolutely for every value of z such that|z|<|€&|. Further, 
if q is @ positive number less than 1, the series converges uniformly 
within the circle |z| Sq| € |. 

We can at once proceed to the following further theorem: 

The two series 


D2) = s va,z’-4 
yl 


4, gvtt 


ven 0 y+l1 


also converge absolutely and uniformly if | z| <q] é |. 

The proof follows exactly as before. Since the series P(z) 
converges for z= &, it follows that the n-th term, a,£", tends to 
zero as ” increases. Hence a positive constant M certainly exists 
such that the inequality | a,é*| << M holds for all values of n. 
If now |z|=q|&|, where 0<q <1, we have 


Ig) == 


M|é\| ght. 
n+l 


|a,,2"| <Mq", | Nae” | -<paiae M ng, |—~ gntl | Pegeian eS | 


rey"? 


We thus obtain comparison series which, as we have seen already 
(p. 524), converge absolutely. Our theorem is thus proved. 


naa 
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In the case of a power series there are two possibilities: either 
it converges for all values of z, or there are values 2 = y for which 
it diverges. Then by the theorem above the series must diverge 
for all values of z for which | z| > y (cf. Vol. I, p. 400), and, just 
as in the case of real power series, there is a radius of convergence 
p such that the series converges when | z| < p and diverges when 
|z] > p. The same applies to the two series D(z) and I(z), the 
value of p being the same as for the original series. The circle 
| 2| = p is called the circle of convergence of the power series. 
No general statements can be made about the convergence or 
divergence of the series on the circumference of the circle itself, 
ie. for | 2] = p. 


3. Differentiation and Integration of Power Series. 
It is natural to call an expression of the form 
S (2) = Gq + a2 + ag2? - + aye” 
with fixed (complex) coefficients a, a function of z, and more 
particularly a polynomial of the n-th degree in z. In the same 
way, a convergent power series 
ao 
P(2) = Daz” 
v=0 
is regarded as a function of the complex variable z in the interior 
of its circle of convergence. In that region it is the limit to 
which the polynomial 


P,@=tazv’ 
v=0 


tends as n tends to infinity. 

A polynomial f(z) may be differentiated with respect to the 
independent variable z in exactly the same way as for the real 
variable. In the first place we notice that the algebraic identity 


2%" =2 — znd n-2 n—1 
ar qr t-pan z+... te 
holds. If we now let z, tend to z*, we immediately have 
d 


* The concept of a limit for a continuous complex variable (2, > z) can be 
introduced in exactly the same way as for the real variable. 


VI] INTRODUCTION 527 


In the same way we immediately have 


P,!(2) = 4 P,(2) = Yim Pals) — Pa® _ ¥ a2 = D, le). 


2,2 4 2% v=l 
We naturally call the expression P,,'(z) the derivative of the com- 
plex polynomial P,,(z). 
We now have the following theorem, which is fundamental 


in the theory of power series: 
A convergent power series 


P(2) = az” 
vex Q 


may be differentiated term by term in the interior of tts circle of 
convergence. That 1s, the limit 
P'(z) = lim P(2) — P(2) 
2)—>s 4% 
exists, and 


P'(z) = & va,z-4 = lim P,'(z) = lim D,(2) = D(z). 
vol no n—>@o 


From this theorem it is at once clear that the power series 


I) =S vt 
y=ov + 1 


may be regarded as the indefinite integral of the first power series, 
ie. that I'(z) = P(z). 
The term-by-term differentiability of the power series is 


proved in the following way: 
From p. 526 we know that the relation D(z) = lim D,(2) 
n—>o 
holds within the circle of convergence. We have to prove that 
the absolute value of the difference quotient Pe)" differs 


from D(z) by less than a prescribed positive number ¢, if only we 
take z, sufficiently close to 2 within the circle of convergence, 
For this purpose we form the difference quotient 


Pl) — P(e) _ Pals) — Pal) 5 F gn, 
4 % co poon+1 te 


D(z, 2) = 
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where for brevity we write 
2 z,” — 2” 
4, ae 
If we keep to the notation used on p. 525, and if |z|<q| ¢| 
and also | 4,|<q| é|, then it is certain that 


[A tse t ep. 


= 2 tt greet tp 2, 


Hence 


| 2, |= a 


a co} 
< & jalyt|eprs 3 gn. 
vant] | é | pn) 
Owing to the convergence of the series of positive terms Zvq’-, 
the expression | R,, | can therefore be made as small as we please, 
provided we make » sufficiently large. We choose n so large that 
this expression is less than ¢/3, and also so large—increasing n 
further if necessary—that | D(z) — D,(z)|<«/3. We now 


choose 2, so close to 2 that the absolute value of Pales) — Pal) 
also differs from D,(z) by less than «/3. Then aa 


C+) 
= 4,A, 
vont 


| Dey 2) — De) | |= =F) — yo 
+| Dil) — D@)| +2, 
Sg sg > © 


and this inequality expresses the fact asserted. 

Since the derivative of the function is again a power series 
with the same radius of convergence, we can differentiate again 
and repeat the process as often as we like. That is, a power 
series can be differentiated as often as we please in the interior of its 
circle of convergence. 

Power series are the Taylor series of the functions P(z) which 
they represent: that is, the coefficients a, may be expressed by the 
formula 


a, = 1 Porgy. 
ri 


The proof is word for word the same as for the real variable 
(cf. Vol. I, p. 404). 
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4. Examples of Power Series. 


As we mentioned in Chap. VIII, § 7 (p. 418) of Vol. I, the power series 
for the elementary functions can immediately be extended to the complex 
variable; in other words, we can regard the power series for the elementary 
functions as complex power series and extend the definitions of these 
functions to the complex realm in this way. For example, the series 

O gy © 2 OO (__]\rg2r+1 ) Qu ro) Qv+1 
Soe PGi 8 ae. ee 

v9 VE y= (2v)P ,20 (2¥+ A)” p20 (2v)! ,=0(2¥ + 1) 
converge for all values of z. (This follows at once from comparison tests.) 
The functions represented by these power series are again denoted re- 
spectively by the symbols e, cosz, sinz, coshz, sinhz, just as in the real 
case. The relations 

cosz + ising = e*, 


coshz = cosiz, isinhz = siniz 


now follow immediately from the power series. Again, by differentiating 
term by term we obtain the relation 


d 


— e% = e3, 
dz 


As examples of power series with a finite radius of convergence, 
other than the geometric series, we consider the series 


log(1 +2) = B(—ayt1 = 


v= 


© gat 1 
aro tanz = X (—1)’ = 3, log ( + iz) — log(1 — #z)}, 


y=: 0 2v+ 1 


whose sums we again denote by the symbols log, arc tan. Here the radius 
of convergence is again 1. Differentiating term by term, we have 


dlog(l+z) 1 _ 1 
de 7 Te ie (are tanz) = i; 
EXAMPLES 
1, For which points z= x + ty is 
z—-1 <1? 
z+1 


2. Prove that if Xa," is absolutely convergent for z= , then it is 
uniformly convergent for every z such that | z| S| C|. 


3. Using the power series for cosz and sinz, show that 


cos*z + sin?z = 1, 
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4*, For what values of z is 


ye=k 1-2 


convergent? 


2. FouNDATIONS oF THE THEORY oF FUNCTIONS OF 
A CompLEX VARIABLE 


1. The Postulate of Differentiability. 


As we have seen above, all functions which are represented 
by power series possess a derivative and an indefinite integral. 
This fact may be made the starting-point for the general theory 
of functions of a complex variable. The object of such a theory 
is to extend the differential and integral calculus to functions of 
a complex variable. In particular, it is important that the con- 
cept of function should be generalized for complex independent 
variables in such a way that the function is differentiable in the 
complex region. 

We could, of course, confine ourselves from the very beginning 
to the consideration of functions which are represented by power 
series and thus satisfy the postulate of differentiability. There 
are, however, two objections to this procedure. In the first place, 
we cannot tell @ priort whether the postulate of the differen- 
tiability of a complex function does necessarily imply that the 
function can be expanded in a power series. (In the case of the 
real variable we saw that functions even exist which possess 
derivatives of any order and yet cannot be expanded in a power 
series (ci. Vol. I, p. 335).) In the second place, we learn even from 
the case of the simple function 1/(1 — z), whose power series, 
the geometric series, converges in the unit circle only, that even 
for simple functional expressions the power series does not 
represent the whole behaviour of the function, which in this 
particular case we already know in other ways. 

These difficulties can, it is true, be avoided by a method due 
to Weierstrass, and the theory of functions of a complex variable 
can actually be developed on the basis of the theory of power 
series. It is desirable, however, to emphasize another point of 
view, which is due to Cauchy and Riemann. In their method, 
functions are characterized not by explicit expressions but by 
simple properties. More precisely, the postulate that a function 
shall be differentiable, and not that it shall be capable of being 
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represented by a power series, is to be used to mark out the region 
in which a function is defined. 

We could start a priori from the following general concept 
of a complex function £ = f(z) of the complex variable z. If R 
is a region of the z-plane and if with every point z= 2-+ yin 
R we associate a complex number = «+ tw by means of any 
relation, ¢ is said to be a complex function of z in R. This 
definition, therefore, would merely express the fact that every pair 
of real numbers 2, y, such that the point (z, y) lies in R, has a 
corresponding pair of real numbers 4%, v; i.e. that u and v are any 
two real functions u(x, y) and (2, y), defined in R, of the two 
real variables x and y. 

This concept of function, however, would be much too wide. 
We limit it in the first place by the condition that u(z, y) and 
v(x, y) must be continuous functions in R with continuous first 
derivatives U,, Uy, Vg, Vy. Further, we insist that our expression 
utiw= C =f(z)=f(e+ ty) shall be differentiable in R with 
respect to the complex independent variable 2; that is, the limit 


lim f(a) —f@ _ lim f+ h) — f@ _ prey 


no2 &—% h—>0 h 
shall exist for all values of z in R. This limit is then called the 
derivative of f(z). 


In order that the function may be differentiable it is by no 
means sufficient that ~ and v should possess continuous deriva- 
tives with respect to x and y. Our postulate of differentiability 
implies far more than differentiability in the real region, for 
h=r-+ is can tend to zero through both real values (s = 0) 
and purely imaginary values (r = 0) or in any other way, and 
the same limit f’(z) must result in all cases, if the function is to 
be differentiable. 

If, for example, we put u= z, v= 0, that is, f(z) = f(x + ty) = 2, 
we should have a correspondence in which u(x, y) and v(x, y) are con- 
tinuously differentiable. For the derivative, however, by putting h=r 
we obtain 


tin FEA NH IO Ly FE a, 
r—>o r r—>0 r 
whereas if we put h = is we have 
im {2+ )—S® _ im 90, 
a 


s—>0 bad —>0 8 
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that is, we obtain two entirely different limits, For € = u-+ iy = 2 -+ 2iy 
we similarly obtain different limits for the difference quotient as h tends 
to zero in different ways. 


Thus in order to ensure the differentiability of f(z) we have 
to impose yet another restriction. This fundamental fact in the 
theory of functions of a complex variable is expressed by the 
following theorem: 

If {= u(x, y) + iv(x, y) = f(z) = f(x + iy), where u(x, y) 
and v(x, y) are continuously differentiable, the necessary and suffi- 
cient conditions that the function f(z) shall be differentiable in the 
complex region are 

Uy = Vy, Uy = Vey 


the so-called Cauchy-Riemann differential equations. 

In every region R where u and v satisfy these conditions f(z) 
as said to be an analytic * function of the complex variable z, and 
the derivative of {(z) is given by 


SQ = Ue + Wy = Vy — ty, = : (uy + 4,). 


We shall first show that the Cauchy-Riemann differentia] 
equations form a necessary condition. If we accordingly assume 
that f(z) exists, we must obtain the limit f’(z) by taking h equal 
to a real quantity r. That is, 


f'@ = lim “2 LAY) Hey), ete y) — oe y) 
r—>0 r ¥ 


= Up + Wy. 


In the same way, we must obtain f’(z) if we take h to be a pure 
imaginary 7s, that is, we must have 


f'2) = lim u(x, y + s) — u(x, ¥) m ptmyt 8) — u(z, y) 
s—>0 ws 


as 


1 ; 
= i (Uy + Wy). 
Hence 


Ug + ity = : (uy + i,). 


* The term regular is also used. 


f 
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By equating real and imaginary parts we at once obtain the 
Cauchy-Riemann equations. 

These equations, however, also form a sufficient condition. 
for the differentiability of the function f(z). To prove this, we 
form the difference quotient 


f(@@+h)—f@ 


h 
_uctry +s) —~ uz, y) + if{o(e + 7, y +8) — ole, y)} 
r+ 
tle Sty + iv, + isvy + <, | h| + | | 
r +s : 


where «, and e, are two real quantities which tend to zero 
with | h| = /(r? + s?). If now the Cauchy-Riemann equations 
hold, the above expression immediately becomes 
| h| | *| 

Ug + We + oars mere 
We see at once that as h-> 0 this expression tends to the limit 
u, + %,, and that independently of the way in which the passage 
to the limit 4 > 0 is carried out. 

We now use the Cauchy-Riemann equations, or the property 
of differentiability which is equivalent to them, as the definition 
of an analytic function, on which we shall base our deduction of 
all the properties of such functions. 


2. The Simplest Operations of the Differential Calculus. 


All polynomials, and all power series in the interior of their 
circle of convergence, are analytic functions, by § 1 (p. 527). 
We see at once that the operations which lead to the elementary 
rules of the differential calculus can be carried out in exactly the 
same way as for the real variable. In particular, the following 
rules hold: the sum, the difference, the product, and (provided 
the denominator does not vanish) the quotient of analytic func- 
tions can be differentiated according to the elementary rules of 
the calculus, and hence are again analytic functions. Further, 
an analytic function of an analytic function can be differentiated 
according to the chain rule and therefore is itself an analytic 
function. 
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We also note the following theorem: if the derivative of an 
analytic function ¢ = f(z) vanishes everywhere in a region R, 
the function is a constant. 

Proof.—We have uz, — iu,=0 everywhere in R. Hence 
Uz = 0, uy = 0, and in virtue of the Cauchy-Riemann equations 
V_ = 0, v, = 0; that is, w and v are constants; hence ¢ is a 
constant. 

Application to the Exponential Function.—We use this theo- 
rem to define the exponential function, oe we have already 


defined by means of the power series e* = = 2”/v!, by means of 


its differential property, in the complex pe also: 
If a complex function f(z) satisfies the differential equation 


f'@=f@, 


then {(z) = ce”, where c is a constant. 

Proof.—As we see at once by differentiating the power series 
(which converges everywhere) term by term, the exponential 
function certainly satisfies the condition. If g(z) is another 
function for which g’(z)= g(z), it immediately follows that 
F(2)9'(2) — 9(2) f'(2) = 0 everywhere in RB. We are entitled to 
assume that g(z) is not zero at any point, as otherwise our relation 
would be satisfied at that point by f(z) = 0, or c = 0, which 
gives f(z) = 0 everywhere. Then the equation (fg’ — f’g)/9? = 0 
means that the derivative of the quotient f/g vanishes, i.e. that 
flg is constant, which is what we asserted. 

From this follows the functional equation of the exponential 
function, 

ee™ = EF ts, 


(On the basis of the power series definition this functional equa- 
tion is by no means a trivial assertion.) We obtain it by con- 
sidering the function g(z) = e*+*, where ~ is fixed. By the chain 
rule, g(z) satisfies the differential equation g‘(z) = g(z). Hence 
by the above theorem g(z) = ce*. To determine ¢ we put z= 0 
and bear in mind that according to the power series definition 
e° = j. Thus we at once have g(0) = e = ¢, and the functional 
equation follows. 

In § 3 (p. 542) we shall develop a more satisfactory method 
for discussing the exponential function independently of the 
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power series. Here we merely mention that in particular for 
2 ©, % = ly 
ettiv — ereiv — e%(cosy + 1siny). 


It follows further that the exponential function can never vanish, 
for if e* vanished, then e* = e*e*—*: would vanish for all values 
of z, which is certainly not the case. 
Making use of the facts that cos27 = 1 and sin2a = 0, we 
immediately have 
et 1, 


The exponential function therefore satisfies the equation 
Gas ot tnt, 


that is, it is periodic with period 271. 


EXAMPLE 


Prove that the product and the quotient of analytic functions and the 
function of an analytic function are again analytic, using not the property 
of differentiability but the Cauchy-Riemann differential equations. 


3. Conformal Representation. Inverse Functions. 


By means of the functions u(z, y) and o(2, y) the points of the 
z-plane or zy-plane are made to correspond to points of the 
¢-plane or wv-plane. Thus we have a transformation or mapping 
(Chap. III, § 3, p. 133) of regions of the xzy-plane on to regions 
of the wo-plane. The Jacobian of the transformation is 


O(u, v) 
D= 
o(z, y) 


The Jacobian is therefore different from zero, and is in fact posi- 
tive, wherever f’(z)+-0. If we assume that f"(z) = 0, our previous 
results (Chap. III, § 3, p. 152) show that a neighbourhood of the 
point 2, in the z-plane, if sufficiently small, is mapped uniquely, 
reversibly, and continuously on a region of the ¢-plane in the 
neighbourhood of the point ¢y) = f(%). This mapping is conformal, 
i.e. angles are unchanged by it. For, as we have seen in Chap. IIT, 
p. 166, the Cauchy-Riemann equations are the necessary and 
sufficient conditions that the transformation may be conformal, 
not only the magnitude but also the sign of angles being pre- 
served. We thus have the following result: 


= UV, — Ue = Ug? + yf = | f’(2) |?. 
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Conformality of the transformation given by u(x, y) and v(x, y) 
and analytic character of the function f(z) = u + iv mean exactly 
the same thing, provided we avoid points z» for which {'(zo) = 0. 

The reader should study the examples of conformal representation 


discussed in Chap. III, § 3, p. 136, and prove that all these transformations 
can be expressed by analytic functions of simple form. 


Since in the case of a unique reversible conformal represen- 
tation of a neighbourhood of z on a neighbourhood of %, the 
reverse transformation is also conformal, it follows that z= x ++ ty 
may also be regarded as an analytic function ¢({) of £ = w+ w. 
This function is called the inverse of £ = f(z). 

Instead of using our geometrical argument, we can at once 
establish the analytic character of this inverse by calculating the 
derivatives of x(u, v), y(u, v) as on p. 143. We have 


Ly bane D Ly D 2 Yu D’ y. c 7 D > 
and we see that the Cauchy-Riemann equations 2, == Y,, Ty= —Yy 


are satisfied by the inverse function. As we can at once verify, 
the derivative of the inverse z= ¢(¢) of the function £ = f(z) 
is given by the formula 


de dt 
didz 
ExAMPLeEs 


1. Find where the following functions are continuous: 
ps z+2 2+ 
a) & 6 a\s iC) = a, 
( () j2\ ey «® Th 
2. Which of the functions in Ex. 1 are also differentiable? 
3*. Prove that a substitution of the form 


ta 2 +B 
Bz + a 
where « and @ are any complex numbers satisfying the relation 
ae— BB=1, 


transforms the circumference of the unit circle into itself and the interior 
of the circle into itself. Prove also that if 


6B — ac = 1, 
the interior is transformed into the exterior. 
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4, Prove that in the transformation ¢ = 4(z + 1/z) the circles with 
centres at the origin and the straight lines through the origin of the z-plane 
are respectively transformed mto confocal ellipses and hyperbolas in the 
¢-plane (cf. Ex. 5, p. 158). 


5. Prove that a substitution ¢ = cl E leaves the cross ratio 
an / *1 4 of four points 2, 22, 23, 2, unaltered. 
By — &y ff % — 2 

6*. Prove that any circle may be transformed by a substitution of the 


form ¢ = % +8 into the upper half-plane bounded by the real axis. (Use 
yz 


+8 
Ex. 1, p. 529.) 
7. Prove the following property of the general linear transformation 
t= az+b 
ca a 


where a, b, ¢, d are constants and ad — be + 0: 

All circles and straight lines in the z-plane are transformed by this 
relation into all straight lines and circles in the ¢-plane. 

If the z-plane and the ¢-plane are imagined to coincide, the points z 
for which ¢ = z are called fixed points. In general there are two different 
fixed points. Show that in this case the family of circles through the two 
fixed points and the family of circles orthogonal to them transform into 
themselves. ‘ 

8. The inverse of the power function ¢ = 2" is unique in the neighbour- 
hood of every point 2), provided z= 0, for then the derivative ner 
does not vanish. The point z = 0, where the derivative vanishes, however, 
forms an exception; hence the many-valuedness of the function Vs 
We shall discuss these relations more closely in § 6, p. 563. 


3. Tae INTEGRATION oF ANALYTIC FUNCTIONS 


1. Definition of the Integral. 


The central fact of the differential and integral calculus of 
functions of a real variable is expressed in the theorem that 
the integral of a function (the upper limit being undetermined) 
may be regarded as the primitive function or “ indefinite integral ” 
of the original function (Vol. I, p. 109). A corresponding relation 
forms the nucleus of the theory of analytic functions of a com- 
plex variable. 

We begin by extending the definition of the definite integral 
of a given function f(z). Here it is convenient to use ¢= 7 + as 
instead of the independent variahle z, as we shall use ¢ to denote 
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the variable of integration. Let the function f(t) be analytic in 

a region R, and Jet t = f) and t = z be two points in this region, 

joined by an oriented curve C which is piecewise smooth and lies 

wholly within R (fig. 1). We then subdivide the curve C into n 
portions by means of the succes- 
sive points f, 4,...,¢, = 2 and 
form the sum 


8, = % FN, me t,1); 


where ¢,’ denotes any point lying 

on C between ¢,_, and 4. If 

we now make the subdivision 

finer and finer by letting the 

Fig. 1 number of points increase with- 

out limit in such a way that the 

greatest of the intervals | t, — ¢,_,]| tends to zero, S,, tends to 

a limit which is independent of the choice of the particular inter- 
mediate point ¢,’ and of the points ¢,. 

This can be proved directly by a method analogous to that 
used to prove the corresponding theorem of the existence of the 
definite integral for real variables. For our purpose, however, 
it is more convenient to reduce the theorem to what we already 
know about real curvilinear integrals (cf. Chap. V, § 1, p. 344), 
as follows. We put f(t)= u(r, s)+ iv(r, 8), L=7,+ 4s, 
i,’ = 17,’ + ts,’, At, = t, —t,_, = Ar, + 7As,. Then we have 


n 
S, = = u(r,’, s,') Ar, = u(r,’ 8,’) As, 
v=1 


+4 z o(r,’, s,)Ar, + ulr,’, s)As,}. 
yx] 


As n increases the sums on the right-hand side tend to the real 
curvilinear integrals i (udz — vdy) and + f (vda + udy) respec- 
o o 


tively, and hence, as we asserted, S, tends to a limit. We call 
this limit the definite integral of the function f(¢) along the curve 
C from f to 2, and write it 


j fd or i S(O) dt. 
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Thus [foa= funda —ody) + if (ode + udy). 


The definition of this definite integral (cf. Chap. V, § 1, p. 349) 
at once gives the following important estimate: f \{(t)| <M 
on the path of integration, where M is a constant and L is the 
length of the path of integration, then 


[ freee | < ML. 


In addition we may point out that operations with complex 
integrals (in particular, combination of different paths of in- 
tegration) satisfy all the rules stated in this connexion for curvi- 
linear integrals in Chap. V, § 1, p. 347-9. 


2. Cauchy’s Theorem. 


The essential fact of the theory of functions of a complex 
variable is that the integral between t) and z is largely indepen- 
dent of the choice of the path of integration CO. In fact, we have 
Cauchy’s theorem: 

If the function f(t) is analytic in a simply-connected region R, 
the integral 


f F(jde=f fae 


is independent of the particular choice of the path of integration 
C joining ty and z in R; the integral is an analytic function F(z) 


such that 
4 re) = 2( [F0a) =S0. 


F(z) is accordingly a primitive function or indefinite integral 
of {(z). 

Cauchy’s theorem may also be expressed as follows: 

If subject to the above assumptions we take the integral of £(t) 
round a closed curve lying in a simply-connected region, the integral 
has the value zero. 

The proof that the integral is independent of the path follows 
immediately from the main theorem on curvilinear integrals 
(cf. Chap. V, § 1, p. 353); for both udx — ody, the integrand in 
the real part, and vdx + udy, the integrand in the imaginary 
part, satisfy the condition of integrability, in virtue of the Cauchy- 
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Riemann equations (p. 532). Thus the integral is a function of 
Ly or 2+ yw=2, F(z)= U2, y) + iV(a, y), and from our 
previous results for curvilinear integrals we have the relations 


U,=4u, U,=—v, Vz=%, Vy=4, 
that is, 
U,= V, U,=—V,, Us+iV,z=ut w, 


which shows that F(z) is actually an analytic function in R with 
the derivative F’(z) = f(z). 

The assumption that the region is simply-connected is essential 
for the validity of Cauchy’s theorem. 


For example, we may consider the function 1/, which is analytic every- 
where in the ¢-plane except at the origin. We are, however, not entitled 
to conclude from Cauchy’s theorem that the integral of 1/t, taken round 
a closed curve enclosing the origin, vanishes. For this curve cannot be 
enclosed in a simply-connected region in which the function is analytic. 
The simple connectivity of the region is destroyed by the exceptional 
point ¢ = 0. If we take the integral e.g. round a circle K given by |t|=r 
or ¢ = re’® in the positive sense, and make @ the variable of integration 
(dt = rie®d®), we have 


dt 2a gyeif 7 


that is, the value of the integral is not zero but 277i. 


We can, however, extend Cauchy’s theorem to multiply- 
connected regions as follows: 

If a multiply-connected region 

R is bounded by a finite number 

of sectionally smooth closed curves 

Cy, Cy,..., and if f(z) is analytic 

in the interior of this region and 

also on its boundary,* then the 

sum of the integrals of the function 

elo Sy, along all the boundary curves is 

zero, provided that all the boundaries 

are described in the same sense relative to the interior of the region R, 

te. that the region R is always on the same side, say the left-hand 

side, of the curve as i 1s described. 
The proof follows at once, on the model of the corresponding 


* A function is said to be analytic on a curve if it is analytic throughout 
a neighbourhood, no matter how small, of this curve. 
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proofs for curvilinear integrals; we cut up the region R into a 
finite number of simply-connected regions (figs. 2, 3), apply 
Cauchy’s theorem to these regions separately, and add the results. 


Fig. 3.—A multipiy-connected region R subdivided by Q1, Op --- into 
simply-connected regions 


We can express this theorem in a somewhat different way: 

If the region R is formed from the interior of a closed curve C 
by cutting out of this interior the interiors of further curves Cy, 
Cy...» then 


[fouaZf fede 


where the integrals round the external boundary C and the internal 
boundaries are to be taken in the same sense. 


3. Applications. The Logarithm, the Exponential Function, and 
the General Power Function. 


We can now use Cauchy’s theorem as the basis for a satis- 
factory theory of the logarithm, the exponential function, and 
hence of the other elementary functions, following a procedure 
similar to that adopted for the real variable (Vol. I, Chap. Ti, 
§ 6, p. 167). 

We begin by defining the logarithm as the integral of the 
function 1/t. At first we limit the path of integration by making 
it lie in a simply-connected region, making a cut along the nega- 
tive real a-axis, that is, permitting no path of integration which 
crosses the negative real axis. More precisely: if we put 
t= |t| (cosd + ¢sin Q), we limit @ by the inequality —7 <9S7. 
In the ¢-plane, after the cut has been made, we join the point 
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t= 1 to an arbitrary point z by any curve C, and we can then 
use Cauchy’s theorem to integrate the function 1/¢ between these 
two points, independently of the path. The result is an analytic 
function, which we call logz: 


f= loge =F =f. 


The logarithm has the property that 


Fig. 4.— log z = log | z{ + 16 


inverse function of the logarithm, z= 9(f). We have 9(0) = 1, 
and by the formula for the derivative of the inverse 


9(5) = Uf") = z= 9(6). 


By § 2, p. 536, the inverse is thus determined uniquely and is 
identical with the exponential function defined previously: 
g(t) = &. 

The function f(z) = logz is uniquely determined, except for 
an additive constant, by its differentiation property f’(z) = 1/z. 
For if there were another function 9(z) with this property, their 
difference would have the derivative zero and would therefore be 
constant. Since the function g(z) = f(az) = log (az) satisfies the 
condition g'(z) = af'(az) = a/az = 1/z, by the chain rule, we have 
log (az) = g(z) = ¢ + logz, where c is a constant independent of 
z. Its value is determined by putting z= 1, ie. loge = 0, and 
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we thus have log(a) =. This gives the addition theorem for the 
logarithm, 
log (az) = loga + logz. 


loge = [°F 


is easily evaluated explicitly by taking the straight line joining 
the points t= 1 and ¢=|z| together with the circular arc 
|¢|=|z|as the path of integration. We have 


The integral 


logz = log| z|+ 7, 


where @ is the argument of the complex number z (fig. 4). 
The value obtained in this way for the logarithm of any com- 
plex number z, whose argument lies in the interval —7 < 6s7n, 


Zz 


Fig. 5.— log z = log|z| + £6 + 28 


is often called the principal value of the logarithm. This termino- 
logy is justified by the fact that other values of the logarithm 
can be obtained by removing the condition that the negative 
real axis must not be crossed. We can then join the point 1 to 
the point z by a point which encloses the origin t = 0. On this 
curve the argument of ¢ will increase up to a value which is 
greater or less than the argument previously assigned to z by 
2a. We then have the value 


logz = log | z|-+ 10 + 2m 


for the integral (fig. 5). In the same way, by making 
the curve travel round the origin in one direction or the 
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other any integral number of times n, we obtain the value 
loge = log | z| + 70 +- 2nai. 


This expresses the many-valuedness of the logarithm. 

In the case of the exponential function this many-valuedness 
is exhibited in the equation e = 1. For the same value of z 
corresponds to all the different values {= logz, which differ 
only by multiples of 277. In the inverse of the logarithm, i.e. the 
exponential function, the addition or subtraction of 28 to or 
from the argument must not alter the value of the function: 
(C+ 2ri) = d(Z), or St°M= ec. Tf C=0, we have the 
equation ¢?" = 1, 

If we now introduce the trigonometric functions sinz and cosz 
by means of the equation 


e# = cosz-- ¢ sing, 


which we now may take as their definition, we see at once that 
these functions have the period 27. Thus we have deduced 
the periodic character of the trigonometric functions without 
reference to their elementary geometrical definitions. 

Now that we have introduced the logarithm and the expo- 
nential function it is easy to introduce the general power functions 
a* and 2*, where a and @ are constants (cf. the corresponding 
discussion for the real variable in Vol. I (Chap. ITT, § 6, p. 173) ). 
We define a* by the relation 


a?= e loga | 


where the principal value of loga is to be taken. In the same way 
we define z* by the relation 


2g — e logz 


While the function a* is defined uniquely if we use the princi- 
pal value of loga in the definition, the many-valuedness of the 
function 2* goes deeper. Taking the many-valuedness of logz 
into account, we see that along with any one value of 2* we also 
have all the other values which are obtained by multiplying one 
value by e*"™, where is any positive or negative integer. If 
a is rational, say a = p/g, where p and g are integers prime to 
one another, among these multipliers there are only a finite 
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number of different values (whose g-th power must be unity). 
Tf, however, a is irrational, we obtain an infinite number of 
different multipliers. The many-valuedness of the function 2 
will be discussed in greater detail in § 6 (p. 563). 

As we see from the chain rule, these functions satisfy the 
differentiation formule 


d(a*) — 7ft d(z*) ms a—1 
a a’ loga, ae az". 
ExamMPies 


1. (The gamma function.) Prove that the integral 
Tee] 
Te) = f lett 
0 


(where the principal value of ‘#—1! is taken), extended over all real values 
of the variable of integration ¢, is an analytic function of the parameter 
z= 2-+ ty, if >. (Show directly that the expression I'(z) can be 
differentiated with respect to z.) Prove that the gamma function thus 
defined for the complex variable satisfies the functional equation 
F(z + 1) = az). 

2*, (Riemann’s zeta function.) Taking the principal value of n*, form 
the infinite series 

hala S 
wera 0) 

Prove that this series converges if x > 1 and represents a differentiable 
function (¢(z) is called Riemann’s zeta function). The proof can be carried 
out directly by a method like that for power series (cf. Vol. I, p. 382). 


4. Cavcnuy’s FoRMULA AND ITs APPLICATIONS 


1. Cauehy’s Formula. 


Cauchy’s theorem for multiply-connected regions leads to a 
fundamental formula, again due to Cauchy, which expresses the 
value of an analytic function f(z) at any point z= a in the 
interior of a closed region R, throughout which the function is 
analytic, by means of the values which the function takes on the 
boundary C. 

We assume that the function f(z) is analytic in the simply- 
connected region & and on its boundary C. Then the function 

_ J) 
92) = ray 
19 (#912) 
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is analytic everywhere in the region R, the boundary C included, 
except at the point z= a. Out of the region R we cut a circle 
of small radius p about the point z = a, lying entirely within R 
(fig. 6), and then apply Cauchy’s theorem (p. 541) to the function 
g(z). If K denotes the circum- 
ference of the circle described 
in the positive sense and C the 
boundary of FR described in the 
positive sense, Cauchy’s theorem 
states that 


i g(z)dz = f g(z) dz. 


On the circle K we have 
z=a-+ pe, where the angle 6 determines the position of the 
point on the circumference. On the circle, therefore, dz = pie” d8, 
and hence 


Fig. 6 


Qn , 
i g(z)dz = i f flat pe) dd. 


Since f(z) is continuous at the point a, we have, provided p is 
sufficiently small, 


S(a+ pe") =f(a) +, 


where | 7 | is less than an arbitrary prescribed positive quantity e. 
Hence 


2a ; ar Qa 

f fat pe) de —f fade | = J 0 | < Ine, 
0 0 0 

and therefore 


2r 
[ fa + pet) d0 = 2nf(a) + «, 
where | «|S 27e. Thus if p is sufficiently small 
J[[g@)dz = 2mif(a) + xi, 


where | «i |<. 
If we make ¢« tend to zero (by making p tend to zero), 
the right-hand side of the equation tends to 27if(a), while 


the value of the left-hand side, namely, f g(z)dz, is unaltered. 
0 
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We thus obtain Cauchy’s fundamental integral formula 


_ 1 7 fe 
ae 


If we now revert to the use of ¢ as variable of integration and 
then replace a by z, the formula takes the form 


f= = f LO a. 


714 4ot — 2 


This formula expresses the values of a function in the interior 
of a closed region in which the function is analytic by means 
of the values which the function takes on the boundary of the 
region. 

If in particular C is a circle t= z-+ re with centre z, that is, if 
di = ire!°d0, then 


2 
f= = f Fle + rei*)a0, 


In words: the value of a function at the centre of a circle is equal to the mean 
of tts values on the circumference, provided that the closed area of the circle 
is a region in which the function is analytic. 


2. Expansion of Analytic Functions in Power Series. 


Cauchy’s formula has a number of important theoretical 
applications, the chief of which is the proof of the fact that 
every analytic function can be expanded in a power series, which 
thus connects the present theory with that given in § 1 (p. 527). 
More precisely, we have the following theorem: if the function 
f(z) ts analytic in the interior and on the boundary of a circle 
|z—2| SR, i can be expanded as a power series in z— 24 
which converges in the interior of that circle. 

In proving this we can take z)=0 without loss of generality. 
(Otherwise we should merely have to introduce a new indepen- 
dent variable z’ by means of the transformation z— z = 2’.) 
We now apply Cauchy’s integral formula to the circle C, | z| = RB. 
and write the integrand (using the geometric series) in the form 


POI? 2 IO ly et a+. 2) Hen it 
—Z 


t—2z t l—2zjt t 


Since z is a point in the interior of the circle, | z/t | = q is a positive 
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1 gntl 1 
tt y— ait 

of the geometric series, we obviously have the estimate 
1 

l—q 


number less than unity, and for r,= the remainder 


| tn | <ir 


Introducing our expressions into Cauchy’s formula and integrat- 
ing term by term, we obtain 


FO=qtazt... bee + R,, 


where 


1 
= 5 [FOrt. 


If M is an upper bound of the values of | f(t)] on the circum- 
ference of the circle, our estimation formula for complex integrals 
(cf. § 3, p. 5389) immediately gives 

grt 
1—q 


for the remainder. Since q is a proper fraction this remainder tends 
to zero as ” increases, and for f(z) we obtain the power series 


~o 
L@= 26, 
v=0 


M 


|B Spy fm 7RM = 
q 


where 


Our assertion is thus proved. 

This theorem has important results. To begin with, we know 
from § 1 (p. 528) that every power series can be differentiated as 
often as we please in the interior of its circle of convergence. 
Since every analytic function can be represented by a power 
series, it follows that the derivative of a function in the interior 
of a region where the function is analytic is also differentiable, 
Le. is again an analytic function. In other words, the operation 
of differentiation does not lead us out of the class of analytic func- 
tions. As we already know that the same is true for the operation 
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of integration, we see that differentiation and integration of 
analytic functions can be carried out without any restrictions. This 
is an agreeable state of affairs, which does not exist in the case 
of real functions. 

Since, as we saw in § 1, p. 528, every power series is the 
Taylor series of the function which it represents, it now follows 
in general that every analytic function can be expanded in the 
neighbourhood of a point z = 2 in a region R where the function 
is analytic in a Taylor series 


Fe) = flee) +E PO (ey 


the coefficients c, above are — given by the formulz 


_ £0) _ LO xy 
a salon 


vl pad 


¢,= 


From our result we may also deduce an important fact about 
the radius of convergence of a power series. The Taylor series 
of a function f(z) in the neighbourhood of a point 2 = 2, certainly 
converges in the interior of the largest circle whose interior lies 
wholly within the region where the function is defined and is 
analytic. 

In virtue of the theorems on differentiation and integration 
which we have now established as valid for the complex variable 
also, all the elementary functions which we expanded in Taylor 
series for the real variable have exactly the same Taylor series 
for the complex variable. For most of these functions we have 
already seen that this is true. 


Here we may point out that e.g. the binomial series 


(l+2¢= = (CG) 


v= 


is also valid for the complex variable if | z | < 1, provided that 
(1 + 24 = erlog(i+2) 


is formed from the principal value of log(I +- 2). 

The fact that the radius of convergence of this series is equal to unity 
follows from what we have just said, together with the remark that the 
function (1 + z)* is no longer analytic at the point z= —1. For if it were, 
all the derivatives must exist there, which is certainly not the case. The 
circle with radius 1 with the point z = 0 as centre is therefore the largest 
circle in the interior of which the function is still analytic. 
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As we have already pointed out in Chap. VIII of Vol. I (p. 414), 
the behaviour of power series as regards convergence only 
becomes completely intelligible in the light of the fact which 
we have just proved about the radius of convergence. 


For example, the failure of the geometric series representing 1/(1 -+ 2*) 
to converge on the unit circle is a simple consequence of the fact 


that the function is no longer analytic for z= +4 and z= —#. We also 
see now that the power series 
z Bz” 
= = La 
e—1 ve’ 


which defines Bernoulli’s numbers (cf. Vol. I, Chap. VIII, Appendix, p. 422), 
must have the circle | z | = 2n as its circle of convergence, for the de- 
nominator of the function vanishes for z = 2ni but (apart from the origin) 
at no point interior to the circle | z | < 27. 


ExaMPi3 


Prove, without using the theory of power series directly, that the 
derivative of an analytic function is differentiable, by successive differen- 
tiation under the integral sign in Cauchy’s formula and justification of the 
validity of this process. 


3. The Theory of Functions and Potential Theory. 


From the fact that analytic functions may be differentiated 
as often as we please it also follows that the functions u(x, y) 
and (2, y) have continuous derivatives of any order. We may 
therefore differentiate the Cauchy-Riemann equations. If we 
differentiate the first equation with respect to # and the second 
with respect to y and add, we have 


AU = Ugg + Uyy = 0; 


in the same way, the imaginary part v satisfies the same equation 
Av = Vag + Vyy = 9. 


In other words, the real part and the imaginary part of an analytic 
function are potential functions. 
If two potential functions u, v satisfy the Cauchy-Riemann 
equations, v is said to be conjugate to u, and —u conjugate to v. 
We accordingly find that the theory of functions of a complex 
variable and potential theory in two dimensions are essentially 
equivalent to one another. 


VIII] CAUCHY’S FORMULA 551 


ExaMPLE 


Show that for every potential function u it is possible to construct a 
conjugate function v and to determine it uniquely apart from an additive 
constant. 


4. The Converse of Cauchy’s Theorem. 


As a further deduction we have the converse of Cauchy’s 
theorem: 

If the continuous function £ = u + iv = f(z) is such that its 
integral round every closed curve C in its region of definition R 
vanishes, then f(z) is an analytic function in R. 

To prove this we note that in any case, by § 3, p. 539, the 


integral [i f@d taken along any path joining a fixed point 4 


and a variable point z is a differentiable function F(z), where 
F(z) = f(z). F(z) is therefore analytic, and by our result above 
8o is its derivative F’(z) = f(z). 

This converse of Cauchy’s theorem shows that the postulate 
of differentiability could have been replaced by the postulate 
of integrability. The equivalence of these two postulates is a 
very characteristic feature of the theory of functions of a complex 
variable. 


5. Zeros, Poles, and Residues of an Analytic Function. 


If the function f(z) vanishes at the point z= 2, the constant 
term in the Taylor series of the function in powers of z— 2, 


F (2) =f Go) + (2 — 2) Fo) + - 


vanishes, and possibly further terms of the series vanish in 
addition. A factor (2 — z))" may then be taken out of the power 
series and we may write 


F(2) = @ — %)"9(2), 


where g(%)) +: 0. A point 2, for which this occurs is said to be a 
zero of the function f(z) of the n-th order. 

The reciprocal 1/f(z) = q(z) of an analytic function, as we 
saw above, is also analytic, except at the points where f(z) vanishes. 
If 2 is a zero of f(z) of the n-th order, the function g(z) can be 
represented in the neighbourhood of the point z in the form 
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where A(z) is analytic in the neighbourhood of z= 2%. At the 
point z = 2, the function 9(z) ceases to be analytic. We call this 
point a singularity (singular point), in this particular case a pole 
of the function q(z) of the n-th order. If we think of the function 
h(z) as expanded in powers of (2— 2%) and then divided by 
(2 — 2)" term by term, in the neighbourhood of the pole we 
obtain an expansion of the form 


(2) = CZ — Zp). « Og (%@— %)* + Cg t+ G(Z— 2%) +--+, 


where the coefficients of the powers of (z— %) are denoted by 


Ohigig- ise tg Cogy Cg, Cig eee 
If we are dealing with a pole of the first order, i.e. if n= 1, 
we obtain the coefficient c_, immediately from the relation 


o- = lim (2— %)q(2). 
> Zo 


Since 
1 _ f@ _ feO—f) 
q(z)(@— %) %—% %— % ; 
we have 
C4 = 1 ‘ 
f'(@) 


In the same way, if q(z) = r(z)/d(z), and ¢(z) has a zero of 
the first order at z= %, while 7(z) ++ 0, we have 


__ 1(%) 


6G) 


If a function is defined and analytic everywhere in the 
neighbourhood of a point 2, but not at the point itself, its 
integral round a complete circle enclosing the point 2, will in 
general not be zero. By Cauchy’s theorem, however, the integral 
is independent of the radius of this circle and in general has the 
same value for all closed curves C which form the boundary of 
a sufficiently small region enclosing the point 2. The value of 
the integral taken round the point in the positive sense is called 
the residue at the point. 

If the singularity is a pole of the n-th order and if we integrate 
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the expansion of the function, the integral of the series with 
positive indices is zero, as this power series is still analytic at the 
point 2%. 

When integrated the term c_,(z— 2%)! gives the value 2sic_1, 
while the terms with higher negative indices give zero, for the 
indefinite integral of (z — z,)~” for v > 1 is (2 — z)"4/(1 — »), 
as in the real case, so that the integral round a closed curve 
vanishes. 

Lhe residue of a function at a pole is therefore 2mic_,. 

In the next section we shall become acquainted with the 
usefulness of this idea as expressed by the following theorem: 

Theorem of Residues. If the function f(z) is analytic in the in- 
terior of a region R and on its boundary C, except at a finite number 
of poles, the integral of the function taken round C in the positive 
sense is equal to the sum of the residues of the function at the poles 
enclosed by the boundary C. 

The proof follows at once from the statements above. 


EXAMPLES 
1*, Show that the function 
1 nm 
fl2)— (8) 2 dt, 


Ini J C— zen 
where the integral is taken round a simple contour enclosing the points 
¢ = 0 and ¢ = z, is a polynomial g(z) of degree n — 1 such that 
g™(0) = fo (0) for m=0,1,...,n—1. 
2. Let f(z) be analytic for|z| <p. If M is the maximum of | f(z) | on 
the circle | z| = p, then the coefficients of the power series for f, 


a 
f (2) = Oe 
ve 
satisfy the inequality 
ja, <% 
e” 


3*. Prove that if a region is bounded by a single closed curve C, and if 
f(z) is analytic in the interior of C and on C and does not vanish on C, then 
1 fea 
2rt Jo f(z) 
is the number of zeros of f in the interior of C. 


4, (a) Two polynomials P(z) and Q(z) are such that at every point on 
w certain closed contour C 
[ Q2)| <| P@)|. 


198 (8912) 
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Prove that the equations P(z) = 0 and P(z) + Q(z) = 0 have the same 
numbers of roots within C. (Consider the family of functions P(z) + 6Q(z), 
where the parameter 8 varies from 0 to 1.) 

(6) Prove that all the roots of the equation 


2+taz+1=0 

lie within the circle | z| = r if 
jaj<rr—}, 
r 


5. If f(z) = 0 has one simple root @ within a closed curve C, prove that 


this root is given by 
fq 
a= =i 5G) 


5. APPLICATIONS TO ComMPLEX INTEGRATION (CONTOUR 
INTEGRATION) 


Cauchy’s theorem and the theorem of residues frequently 
enable us to evaluate real definite integrals by regarding these as 
integrals along the real axis of a complex plane and then simpli- 
fying the argument by suitable modification of the path of in- 
tegration. In this way we sometimes obtain surprisingly elegant 
evaluations of apparentiy complicated definite integrals, without 
necessarily being able to calculate the corresponding indefinite 
integrals. We shall discuss some typical examples. 


1. Proof of the Formula 


f me ds =. 
9 x 2 


Here we give the following instructive proof of this important 
formula, which we have already discussed by other methods 
(Vol. I, pp. 251, 418, 450; Vol. II, p. 315). 

We integrate the function e*/z in the complex z-plane along 
the path C shown in fig. 7, which consists of a semicircle H, of 
radius R, a semicircle H, of radius 7, both having their centres 
at the origin, and the two symmetrical intervals I, and I, of 
the real axis. Since the function e**/z is regular in the circular 
ting enclosed by these boundaries, the value of the integral in 
question is zero. Combining the integrals along J, and J,, we 


have 
f Sat Sar sine 4 
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We now let F tend to infinity. Then the integral along the semi- 
circle H, tends to zero. For if we put z= R(cos@ + 4 sin 6)= Re® 


(hes - 
or R 
Fig. 7 


for points on the semicircle, we have e# = eRe%e~Rein? ang 
the integral becomes ¢/ e®*%~**in?dg The absolute value 


0 
of the factor e® =? is 1, while the absolute value of the factor 
e—Rsin? is less than 1 and, moreover, tends uniformly to zero as 
R tends to infinity, in every interval eX @<37—e. Hence 
it follows at once that the integral along H, tends to zero as 
+o, As the reader can easily prove for himself, the integral 
along the semicircle 1, tends to —7i as r —> 0. The integral along 
the two symmetrical intervals I,, I, of the real axis tends to 


24 [= anz dz as R->o and r+>0,. Combining these statements, 


we ienedately obtain the relation given above. 


2. Proof of the Formula 
[oa] 
f cosaxe dx = 1/ne-t, 
0 


We have already proved this formula in Chap. IV (Ex. 4a, 
p. 318), but we shall now obtain it by means of Cauchy’s 
theorem. 

We integrate the expression e~** along a rectangle ABB’A’ 
(fig. 8), in which the length of the vertical sides 44’, BB’ is a/2, 
and that of the horizontal sides 4B, A’B’ is 2R. This integral 
has the value zero, by Cauchy’s theorem. On the vertical sides 
we have |e~* |= |e~@*—%e~2#y | — eo Me" <e Mel", and this 
expression tends uniformly to zero as # tends to infinity. Thus 
the portions of the whole integral which arise from the vertical 
sides tend to zero, and if we carry out the passage to the limit 
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R-+o and note that on A’B’ dz= d(x -+ 41a) = dz, we may 


express the result of Cauchy’s theorem as follows: 
[i ecette de =f" e*de. 


—2 


That is, we can displace the path of integration of the infinite 
integral parallel to itself. By our previous result * (p. 262) the 


A’ B' 
ee ie 
A 7) B me 


Fig. 8 


value of the integral on the right is 1/7. The integral on the left 
immediately becomes 


cy be] 
ei al e-*"(cosax — isinax)dx = 2e** f cosaze~*'da, 
0 


~@ 


if we remember that sinaz is an odd function and cosaz an even 
function. This proves the formula. 


3. Application of the Theorem of Residues to the Integration of 
Rational Functions. 


If in the rational function 


Uy 4S 
Se) = bo + yz +... + 5,2" 


the denominator has no real zeros and its degree exceeds that of 
the numerator by at least two, the integral 


I=f “Q2) dx 


can be evaluated in the following way. 

We begin by taking the integral along a contour consisting of 
the boundary of a semicircle H of radius R (on which z = Re’, 
06S 7), where R is chosen so large that no pole of Q(z) lies 
on or outside the circumference of the circle, and the real axis 
from —R to-+R. Then on the one hand the integral is equal 


* Cf. also sub-section 6, p. 561. 
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to the sum of the residues of Q(z) within the semicircle, while 
on the other hand it is equal to the integral 


1,=f Qe)de 


plus the integral along the semicircle H. By our assumptions, 
a fixed positive constant M exists such that for sufficiently 
large values of R we have * 


1Q2)|< = 


The length of the circumference of the semicircle is 7R. By our 
estimation formula on p. 539, the integral along the semicircle 


is therefore less in absolute value than 7R oa _ sd , and hence 
tends to zero as R-»> ©. This means that the integral 


= f ) dx 


is equal to the sum of the residues of Q(z) in the upper half-plane, 
We now apply this principle to some interesting special cases. 


We begin by taking 


Qe) u 


1 
aa + be+te oe fe 
where the coefficients @, b, c are real and satisfy the conditions a > 0, 
b? — 4dac < 0. Then the function Q(z) has only one simple pole 


2=4= = {—b + tv (4ac — b4)}, 


where the square root is to be taken positive, in the upper half-plane. By 
the general rule (p. 553), therefore, the residue is 2xt Fey Since 

f(a) = 2az, + b = tv (4ac — 5), 
we have 


be 1 gies 20 
A ee reer ~ V (4ac — BY 


* This follows immediately from the fact that Q(z) = 3, RG), where R(z) 


tends to zero as z—> @ (when x > m + 2) or to dm/by (when n = m + 2). 
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As a second example we shall prove the formula (cf. Vol. I, p. 234) 


+ 
f © eae 
~o 1+2 


Here again we can immediately apply our general principle. In the 
upper half-plane the function 1/(1 + 2‘) = 1/f(z) has the two poles 
z = e = ett, z, = —e— (the two fourth roots of —1 which have a posi- 
tive imaginary part). The sum of the residues is 


fol a cere cee ae ere 
ani {is + yay} = 2g (A+ 3)= Fee 2), 


~ ce 8 2 
= —ni.isin SS = maint = dnv2, 


as was asserted. 
ExaMPies 
1, Prove the formula 


, ede a dae 
a oor 


in the same way as above. 
2. Prove that in general if n and m are positive integers and x > m, 


‘a on de = ®oin(™+! 5), 
—»#l+2 n Qn 


The following proof of the formula 
re dx __ aw (2n)! 


=o (1 a%)rtt 4 (nl)? 


exemplifies the case where the residue at a pole of higher order 
has to be calculated. 

If we replace x by z, the denominator of the integrand is of 
the form (z +- +)"+4(z — 7)"*+4, and the integrand accordingly has 
a pole of the (n + 1)-th order at the point z= ++. To find the 
residue at that point we write 


1 1 1 1 


(+i f@) (2-4) (20+ 2— i) 


1 1 z2—7\-e-1 
- pommel ')™ 
If we expand the last factor by the binomial theorem, the term 
in (z — 7)" has the coefficient 
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1 (—m—-W_ 1 yy +1)... 20 _ ™ (2m)! 
ap n )= a Tae alle 


The coefficient c_, in the series for the integrand in the neigh- 
_1 1 (n)! 
Q2n+1 4 (m})2 
which proves the 


bourhood of the point z= 4 is therefore equal to 
am (2n)! 


The residue 27ic_, is therefore 928 (nl)? 


formula. 


As a further exercise the reader may prove for himself by the theory 
of residues that 


a) ; 
“sna 
ee dx = dre—lel 


~»@+¢ 
(replacing sin x by e?*). 
ExaMPLE 
Let f(z) be a polynomial of degree n with the simple roots 01, Og)... 5 %q 
Prove that 
5 0 (ba G1 2) 
= = =U, 1,...,%— &). 
27a)" ° 
fe 
(Consider f a dz round a closed curve enclosing all the «,’s.) 


4. The Theorem of Residues and Linear Differential Equations 
with Constant Coefficients. 
If 
By + Az + age? +... + Oq2" = Plz) 


is a polynomial of the m-th degree, and ¢ a real parameter, we 
think of the integral 
et*f(2) 
u(t) = dz, 
Oe: Pe) 


taken along any closed path C in the z-plane, which does not 
pass through any of the zeros of P(z), as a function u(¢) of the 
parameter ¢. Let f(z) be a constant or any polynomial in z, of a 
degree which we shall assume to be less than n. By the rules 
for differentiation under the integral sign, which hold unaltered 
forthe complex region, we can differentiate the expression u(t) once 
or repeatedly with respect tot. This differentiation with respect 
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tot under the integral sign is equivalent to multiplication of the 
integrand by 2, 2?, 23,..., as the case may be. If we now form 
the differential expression L[u] = ayu-+ a,w’ + au’ +...+a,u™, 
or, i symbolic notation, P(D)u, where D denotes the symbol] 
of differentiation D = d/dt, we have 


P(Dyu = Lu] = f et* f(z) dz, 


By Cauchy’s theorem the value of the complex integral on 
the right is zero; ie. the function u(t) is a solution of the dif- 
ferential equation L[u] = 0. If f(z) is any polynomial of the 
(n — 1)-th degree, this solution contains » arbitrary constants. 
We may accordingly expect to get in this way the most general 
solution of the linear differential equation with constant co- 
efficients, L[u] = 0. 

Tn fact we do obtain the solutions in the form which we 
already know (cf. Chap. VI, § 4, p. 449), on evaluating the 
integral by the theory of residues, with the assumption that the 
curve C encloses all the zeros 2%, %,..., 2, of the denominator 
P(z) = ay(2 — %)(2 — %)...(@— 2n). Tf we assume to begin 
with that all these zeros are simple zeros, they are simple 
poles of the integrand, and the residue at the point a, is 


oni 5) ot By suitable choice of the polynomial f(z) the 


expressions f(z,)/P’(z,) can be made arbitrary constants; we 
accordingly obtain the solution in the form 


u(t) = X ce, 
vel 
in agreement with our previous results. 

If a zero z, of the polynomial P(z) is multiple, say r-fold, so 
that the corresponding pole of the integrand is of the r-th order, 
the residue at the point z, must be determined by imagining 
the numerator e¢**f(z) = etve“*-*) f(z) also expanded in powers 
of z—z,. We leave it to the reader to show that the residue at 
the point z, gives the solutions te, ... , ¢”—4e' as well as the 
solution e*», 
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6. Proof of the Formula 
fievdx = /h. 


In evaluating the integral on p. 555 we took over this formula 
as known from the theory of real variables. It is, however, 
possible to obtain the result by complex integration, using the 
theory of residues. As this proof is very instructive, we shall give 
it here, although from our elementary point of view its starting- 
point may appear artificial. We begin with a complex integral 
which arises in other branches of mathematics (e.g. the theory of 
numbers). 

We use the symbol /} to denote the straight line z = } + pe" 
(—% < p<) in the zplane, that is, a straight line making 
an angle of 45° with the z-axis and cutting it at the point 3. 
The symbol /—} or /0 will bear a similar meaning. Let u be 
a real parameter. We then consider the integral . 

ertzt + 2riuz 
f (u) = it gunz —1 


This integral is to be regarded as an improper integral, that is, 
we integrate in the first place between the limits p= —R, 
p= R, and then let R tend to infinity. The reader may verify 
that this integral exists by means of an argument following the 
pattern of similar arguments for real integrals. Then 


fut Y—-fw=f sey ole — Vide 
ao: i emia? +2riuz J 
i 
a= eu" ff grilz +4) Jo 
Ai 


As the integrand on the right is regular everywhere, we can 
use Cauchy’s theorem to displace the path of integration 
parallel to itself to any extent, as on p. 556, writing, for 
example, 


f(u+1)—fu) = oot f erie de == eT Mn] 
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where z = pe’”'* on the path of integration and hence 
T= él [e-'dp. 
Lee 
That is, if we substitute Vp = t, we have 
iif Lf? met 
I= e*_ fem" dt. 
VA 7 x 


Again, if we put z= A+ 1 and take A as the new variable 
of integration, we obtain the expression 


gmiM+ Orie 
fM=—f way emernnaa, 
jsgeer tl 
using the facts that ¢? = 1, e* = —1, or 
int Qaida 
—e~27i4 f(y) 5 erin Reid g) I ae 7 o 


By the above result, as we can again displace the path of integra- 
tion parallel to itself, the first integral on the right is equal to 
e—*™"T, If we replace the second integral by the integral obtained 
for f(u) by displacing the path of integration through an interval 
1 to the right, we have te note that the pole A= 0 of the 
integrand lies between the two paths of integration. 

We now apply the theorem of residues—the fact that the 
path of integration /-4 and /$ extends to infinity gives us no 
trouble, in virtue of the analogous discussion on p. 556—prove 
that the residue of the integrand at the point A= 0 has the 
value 1, and then at once obtain the result 


—f (uen n= eT 4 flu) — 1 


from our equation. Here neither 7 nor the function f(u) is ex- 
plicitly known. If, however, we put u = 4, f(u) disappears from 
the equation, and we are left with 

e747 = I, 
But since 


Pee ot fleas, 
TT 4 —x% 


the real integral formula follows at once. 
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6. MAny-VALUED FuNcTIONS AND ANALYTIC EXTENSION 


In defining functions both real and complex we have hitherto 
always adopted the point of view that for each value of the 
independent variable the value of the function must be unique. 
Even Cauchy’s theorem, for example, is based on the assumption 
that the function can be defined uniquely in the region under 
consideration. All the same, many-valuedness often arises of 
necessity in the actual construction of functions, e.g. in finding 
the inverse of a unique function such as the n-th power. In the 
real case we separated different one-valued branches of the inverse 
function in inversion processes such as +/z or v/z. We shall 
see, however, that in the complex case this separation is no 
longer possible, for the various one-valued branches are now 
interconnected. 

We must be content here with a very simple discussion based 
on typical examples. 


For instance, we shall consider the inverse €==</z of the function 
z= 03. To one value of z there correspond the two possible solutions ¢ and 
—C of the equation z=¢?. These two branches of the function are con- 
nected in the following way. Let z= re®, If we then put  =/rei#!2 = f(z), 
¢ = f(z) is certainly analytic in every simply-connected region R ex- 
cluding the origin (where f(z) is no longer differentiable). In such a region 
¢ is uniquely defined, by our previous statement. If, however, we let the 
point z move round the origin on a concentric circle XK, say in the positive 
direction, = ~/reé@/2 will vary continuously; the angle 6, however, 
will not return to its original value, but will be increased by 2x. Hence 
in this continuous extension when we come back to the point z we no 
longer have the initial value © =+/rei#/2, but the value+/re#/2 ¢2ni/2 — 1, 
We say that when it is continuously extended on the closed curve K 
the function f(z) is not unique. 

The function a/2, where » is an integer, exhibits exactly the same 
behaviour. Here every revolution multiplies the value of the function by 
the n-th root of unity, namely ¢ = e?7/", and the function only returns 
to its original value after n revolutions. 

In the case of the function logz we saw (p. 543) that there is a similar 
many-valuedness, in that in travelling once continuously round the origin 
in the positive sense the value of logz is increased by 2ni. 

Again, the function z* is multiplied by e2" per revolution. 


All these functions, although in the first instance uniquely 
defined in a region R, are found to be many-valued when we 
extend them continuously (as analytic functions) and return to 
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the starting-point by a certain closed path. This phenomenon 
of many-valuedness and the associated general theory of analytic 
extension cannot be investigated in greater detail within the 
limits of this book. We would merely point out that the unique- 
ness of the values of a function can theoretically be ensured by 
drawing certain lines in the z-plane which the path traced by z 
is not allowed to cross, or, as we say, by making cuts along cer- 
tain lines. These cuts are so arranged that closed paths in the 
plane which lead to many-valuedness are no longer possible. 


For example, the function logz is made one-valued by cutting the 
2-plane along the negative real axis. The same applies to the function Vz. 
The function V(1 — 22) becomes one-valued if we make a cut along the 
real axis between —1 and +1. 


Once the plane has been cut in this way, Cauchy’s theorem 
can at once be applied to these functions. 

We now give a simple example showing how Cauchy’s theorem 
is applied in a case where many-vulued functions arise, by 
proving the formula 


+1 1 - Qa 
: =f (@@— b)h/0 — ®) il (1) 


where & is a constant which does not lie on the real axis between 
—1 and +1. 


We begin by noting that the function : 


@—hya—* * 
one-valued in the z-plane provided we make a cut along the real 
axis from —1 to +1. If in the complex plane we approach this 
cut § first from above and then from below, we obtain equal and 
opposite values for the square root +/(1 — 2"), say positive from 
above and negative from below. We now take the complex 
integral 


f dz 
o (2— k)/(1 — 2) 


along a path C as indicated in fig. 9. By Cauchy’s theorem we 
can make this path contract round the cut without altering the 
value of the integral. The integral is therefore equal to the 
limiting value obtained when this contraction is made, which is 
obviously equal to 27. On the other hand, if we take the integral 
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of the same integrand along the circumference of a circle K with 
radius R and centre the origin, this integral, by our previous 


Fig. 9 


investigations, tends to zero* as R increases. By the theorem of 

residues, however, the sum of the integrals along C and K is 
equal to the residue of the integrand at the enclosed pole z = k; 

hence 2J is equal to the residue in question. This residue is 


1 I 2m 
ak Vi—-A@—h Ve—1) 
which proves our statement. 

Example of Analytic Extension. The Gamma Function.—In 
conclusion we give yet another example showing how an analytic 
function, originally defined in a part of the plane only, can be 
extended beyond the original region of definition. We shall 
extend the gamma function, which was defined for «> 1 by 
the equation 


T(z) = is “pote di, 


analytically for z <1 also. We can do this e.g. by means of the 
functional equation Pe) == Te+ 1), using this equation to 


define ['(z — 1) when I'(z) is known. By means of this equation 
we can imagine I(z) as extended first in the parallel strip 


*In fact, its value is actually zero, since by Cauchy’s theorem it is 
independent of the radius R, provided that the circle encloses the pole z = &. 
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—1<2<0 and subsequently extended to the next parallel 
strip —2 <x < —1, and so on. 

We can, however, adopt another method, of greater theoretical 
interest, for extending the gamma function. We consider the 
path C in the ¢t-plane indicated in fig. 10, which surrounds the 


SSS == 


Fig. 10.—Loop-integral for the gamma function 


positive real axis of the ¢-plane and approaches this axis asymp- 
totically on either side. We easily see from Cauchy’s theorem that 
the value of the “ loop-integral ”,* 


f t@-le-t dt, 
o 


is unaltered when the loop is made to contract into the 2-axis. 
The integrand ¢*-4e-' then tends tc different values as we 
approach the z-axis from above and below, the values differing 
by the factor e?"*. For x > 0 we thus obtain the formula 


(1 — &#)P(z) = f{ tet dt, 


This formula is deduced subject to the assumption that 2, the 
real part of z, is positive. We see now, however, that the loop- 
integral has a meaning, no matter what the complex number z 
is, since it avoids the origin f= 0. This loop-integral therefore 
represents a function which is defined throughout the z-plane. 
We then define this function by stating that it is equal to 
(1 — e"*)I'(z) throughout the z-plane. The gamma function 
has thus been analytically extended to the whole of the z-plane, 
except the points <0 for which the factor (1 — e””*) vanishes, 
that is, except the points z= 0, z= —1, z= —2, and so on. 

For more detailed and more extensive investigations the 
reader must be referred to the literature of the theory of func- 
tions.f 

* This is again an improper integral, which arises by a passage to a limit 
from an integral along a finite portion of C. The reader may satisfy himself 
that it exists, by an argument similar to chose previously employed. 

{ E.g. MacRobert, Functions of a Complex Variable (Macmillan); Whittaker 


and Watson, Modern Analysis (Cambridge University Press); Watson, Complex 
Integration and Cauchy's Theorem (Cambridge Tracta, No. 15). 
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Misontitanzous Exampues VIII 


1. Write down the condition that three points 2,, 22, 7, may lie in a 
straight line. 

2*. Write down the condition that four points 2), 22, 23, 2, may lie on a 
circle. 

3*. Let A, B, C, Din the z-plane be four points in order on the circum- 
ference of a circle, with co-ordinates 2,, 2, 25, 2s Using these complex 
co-ordinates, show that A4B.CD -+- BC. AD = AC. BD. 


4. Prove that the equation cosz = ¢ can be solved for all values of c. 
5. For which values of ¢ has the equation tanz = ¢ no solution? 

6. For which values of z is (a) cosz, (b) sinz real? 

7. Find the radius of convergence of the power series Za,z", where 


(a) a, = - s being a complex number with a positive real part; 
n 
(3) a, = 05 
(c) a = logn. 
8. Prove the formula 


wm tin (142) 


no 
where z is complex. 
§. Evaluate the integrals 


cosz cos 2 
OL Tee @ f° ee ae, © f° ara® 


goat 


(0 Et 1)(a + 2) 


(a) dziorl<a<2 


by complex integration. 
10. Find the poles and residues of the functions 


ane ae T(z), cotz = piste 
sinz cosz sing 
11*, Find the limiting value of the integral 
cot nt 
o, §— 2% 


as n —> 0, where the path of integration is a square C, with its sides parallel 
to the axes at a distance n + 4 from the origin. Hence, using the theorem 
of residues, obtain the expression for cot xz in partial fractions. 


12*, Using the equation 
dt 
log(1 + 2) = iL aaee 
ive 
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show that the power series for log(1 + 2) converges everywhere on the 
unit circle |z|= 1, except at the point z=—1. By equating the 
imaginary part of the series to the imaginary part of log(1 + e), 
establish the truth of the Fourier series (cf. Vol. I, p. 440) 

$0 = sind — }sin20 + 4sin30—...(—m < 0 < =). 


13*. (a) Prove that the series 


(—1y# 
ye 


fle) = fl2+ iy) = z 


converges for z > 0. 

(6) Prove that this series provides an extension of the zeta function 
(defined in Ex. 2, p. 545) to values of z such chat 0 < a <1, by means 
of the formula 

f(z) = (1 — 2**) (2), 


which is valid for x > 1. 
(c) Prove that the zeta function has a pole of residue 1 at z= 1. 


SUPPLEMENT 


Beat Numpers anp THE Concept or Limit 


In Vol. I, Chapter I, it was taken for granted that the rea] 
numbers form an aggregate within which the ordinary operations 
of arithmetic may be performed as with the tational numbers. 
We shall investigate this assumption more closely here. We 
take the arithmetical operations on the rational numbers as given. 
Our object is then to make an abstract analytical extension of 
the class of rational numbers which shall yield the wider class of 
real numbers, and to do this without relying on intuition in 
our proofs. We must frame our definitions in such a way that, 
as a logical consequence of them, the ordinary rules of arithmetic 
apply to all real numbers just as they do to rational numbers. 

The introduction of irrational numbers will be undertaken in 
close conjunction with a thorough consideration of the concept 
of limit, in which we shall repeat in a revised form the discussion 
of Vol. I, Chapter I, Appendix (p. 58 et seq.).* 


1. Definition of the Real Numbers by means of Nests of 
Intervals. 


The irrational numbers and, in general, the real numbers 
were defined in Vol. I, Chapter I, § 1, p. 8, by means of decimals, 
the rational numbers being represented by terminating or 
recurring decimals. By such a decimal, say a= O-aaoq5.- - , 
we mean that the number represented, called a, lies between 
the rational number a, = 0-a,...4@, and the rational number 
dn + 10-*. The number a is thus determined by means of a 
sequence or nest of progressively smaller and smaller intervals, 
each inside the previous one, the n-th interval being of length 
10-*. 

*The only difference in the point of view will be that here we shall start 
with the logical abstract concept of real numbers, while on the former occasion 


the properties of real numbers were taken for granted. 
569 
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For our present purpose it would be inconvenient to restrict 
ourselves to special nests of intervals where the length of the 
n-th interval is 10-". We begin with the following general defini- 
tion. 

By a rational interval (a|b) we mean the aggregate of all the 
rational numbers x which satisfy the inequalities aS a2< b, 
where a <b and a and 8 are rational numbers. The number 
(b — a) is called the length of the interval. We say that the 
interval (c|d) is contained in the interval (a|d) if a<c<d<b. 
An infinite sequence of rational intervals (a,|},), (ag|0,), .. . is 
called a nest of intervals if every interval (a,|b,) contains the 
next in order, (@n4,|0n+1), and the lengths 6, — a, tend to zero. 
That is, given any positive number e, however small (the number 
€ must, of course, be rational, since no other numbers have as yet 
been introduced), there is a number M(e) such that the lengths 
b, — a, are less than ¢ for all suffixes n which exceed N. 

From the intuitive meaning of a nest of intervals, and re- 
membering in particular how we may pick out any point on the 
number axis by means of a nest of intervals, as on p. 9 of Vol. I, 
we arrive at the idea that we may define an arbitrary real number 
by a nest of intervals. This is to be taken as meaning the following: 
the real number is given by an unending process of approxima- 
tion which is determined by the nest of intervals. The nest whose 
general member is (a,|0,) gives us, with regard to the number 
a to be defined, the fact that this real number lies between a, 
and b,; again, it lies between a, and b,, between a, and b;, and 
soon. The nest of intervals will thus give us two rational num- 
bers, as near together as we please, between which the real 
number lies. 

The essential step is now that we abandon the notion of 
obtaining an objective definition of the irrational numbers. We 
give up the attempt to characterize the irrational numbers as 
given mathematical entities with specific properties. We do not 
say that an irrational number 7s such and such a mathematical 
object; instead, we are content with the process of approximation 
which gives the nest of intervals and regard each such process 
as defining a real number. If there is a rational number @ con- 
tained in all the intervals (a,|b,), the real number defined by the 
nest of intervals (a,|b,) is said to be identical with a. By this 
assumption the rational numbers become real numbers also. 
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The words irrational number or, more generally, real number 
may thus be regarded merely as a brief way of referring to a 
nest of intervals.* 

This is what is meant by the statement that an irrational 
number is given or defined by a nest of intervals. In practice it 
comes to this, that every operation with real numbers is an 
operation with nests of intervals. This offers the possibility of 
making calculations with real numbers depend logically on 
operations with rational numbers. 

Tt is necessary to lay down a procedure for defining addition, 
multiplication, &c., of real numbers by nests of intervals. Here 
the rules must be framed in such a way that the ordinary laws 
of calculation still apply. Moreover, we must ensure that the 
rules of calculation with rational numbers are not contradicted. 

We shall begin by showing that our definition implies an 
ordering of the real numbers by magnitude. This in itself provides 
a sufficient groundwork for the axiomatic foundation of the 
concept of limit and a more thorough understanding of it. When 
this has been achieved, we shall return to the question of the 
tules of calculation with real numbers. 


2. The Real Numbers in Order of Magnitude. 


Let two numbers a and y be given by nests of intervals 
(ay,| bn) = %, and (c,|d@,)= Jn The following three cases may 
oceur. 

(1) From a certain stage n = m) onward every interval j, lies 
to the right of the interval 7,; that is, for n= no, and of course 
for every n> No, we have b,<c,. We then say that y is greater 
than a, or y > a. 

(2) If, on the other hand, from a certain n, onward 4, lies to 
the right of j,,, then we say that a > y. In this case for n = ny 
we have always d, < Gy. 

(3) Neither of the above situations arises. We then say that 
the two nests of intervals 7, and j, define the same number: 
a=y. Thus two nests of intervals define the same number if, 
and only if, the intervals 7, and j, always overlap; that is, if 

* Some process of this kind is often essential in giving a precise formulation 
to mathematical concepts. For instance, in projective geometry, when points 
at infinity are introduced these points are not treated as definite mathematical 


entities in themselves; we merely say that a point at infinity is given by a 
pencil of parallel lines, 
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both a, S d, and 6, = c,; or if the two intervals 4, and j, have 
rational points in common for every n. A special consequence 
of this definition is that if, of two nests of intervals, one is obtained 
from the other by the omission of a finite or infinite number of 
constituent intervals, the two nests define the same real number. 

All these rules giving the magnitude relations between two 
real numbers can be understood immediately from the point of 
view of the intuitive meaning of nests of intervals, 

A few simple facts about inequalities between real numbers 
will now be noticed. They will be of use in what follows. 

We first make the following observation. The relation aS y 
can be inferred from the two defining nests of intervals (a,|b,) 
for a and (c,|d,) for y if we note that from a certain n= ny 
onwards the inequality a, < d, holds.* 

In just the same way we see that the condition c, < b, for 
all large values of n is equivalent to a => y. 


We see at once from the above that if « is a real number determined 
by the nest of intervals (a,,|b,,), then a, Sa<b,. This fact justifies 
our rule, for it shows that any real number is actually contained in every 
interval of the nest which defines it. 


Tf a and B are two real numbers and a < , then by the in- 
terval (a|8) is meant the aggregate of all real numbers ¢ such 
that a €< B. We call the interval a rational interval if its 
“end-points ” a and f are rational numbers. We say that the 
real number € lies in the interior of the interval if the signs of 
equality are absent, so that a< £< f. We describe (a|f) as 
a neighbourhood of the real number y if y lies in the interior of 
(al). 


Every interval has rational numbers r in its interior. 


For let (a,,[6,) and (c,|d,,) be nests of intervals defining the numbers 
wand 8. Since a < §, there is a number m, such that b,, <¢,. Thus 
& SS Ong <Cng SB. We see that r = (by, + ¢,,) /2 is a number with the 
required property. 


From this we obtain the following statement: if (a|f) is a 
neighbourhood of y, then (a|8) contains a rational neighbour- 


*For if a = y this inequality is satisfied, as can be seen from the definition 
of equality, and if a < y then from some number onwards we have by < ¢y, 80 
that a fortiori dy <d,. Conversely, if from some number onwards ay < dy, 
then either b, > cy, for all such values of n, and then a = y by definition, or 
else, for some value of n, by < cy, which gives a < y. 
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hood (a|6) of y. It is only necessary to choose two rational 
numbers a and 6 such thata<a<y<b< 8. It is also easy 
to see that if a < , then rational neighbourhoods (a|b) of a 
and (c{d) of B can be found such that b<c; in other words, 
the two neighbourhoods have no points in common. 

We shall not deal with the fundamental rules of calculation 
until we come to sub-section 8, p. 580. Our next step is to 
resume the analysis of the concept of limit with the help of the 
ideas just explained. 


3. The Principle of the Point of Accumulation.* 


The determination of real numbers by nests of intervals 
forms the essential basis of the proof of the principle of the 
point of accumulation, which is due to Weierstrass. A few 
remarks on the concept of the point of accumulation will first 
be made. 

Let M be an infinite set of real numbers in which it is per- 
missible for the same number to occur more than once, and 
indeed an infinity of times. (For example, 1, 1, 1, ... is such a 
set.) If & is a number such that every neighbourhood of £ contains 
an infinity of numbers belonging to M, then € is called a point of 
accumulation of the set M. The name of course recalls the geo- 
metrical connexion between numbers and points. Since every 
neighbourhood of ¢ contains a rational neighbourhood, it is 
sufficient to formulate the above requirement in terms of rational 
neighbourhoods only. 

An infinite set of numbers need not necessarily have a point 
of accumulation. The set of integers provides an example. A 
point of accumulation of a set need not itself be a member of 
the set. For example, the set 1, 3,3, ...,1/m,... has 0 as a point 
of accumulation, but the definition of the set shows that 0 is not 
one of its members. A set which contains all its pots of accumu- 
lation is said to be closed. The set of all numbers a such that 
0 < x < 25 is not closed, since the points of accumulation 0 and 
25 do not belong to it. On the other hand, 0 S 2 < 25 defines 
a closed set. A set a< 2X bis called a closed interval. 

A set may have an infinity of points of accumulation. For 
example, every real number is a point of accumulation of the 


*The above discussion is essentially a repetition of the text in Vol. I, 
Chap. I, p. 58. The same is true of the next three sub-sections. 
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set of rational numbers. For if a is any real number, which may 
be thought of as given by a nest of intervals (4,,|5,), then every 
neighbourhood of a contains an infinity of intervals (a,,|b,), and 
hence of rational numbers a,, },,. 

The principle of the point of accumulation, which will now 
be proved, runs as follows: 

Every bounded infinite set of real numbers, that is, every infinite 
set of real numbers lying in a definite interval, possesses at least one 
point of accumulation. 

To prove this we have to construct a nest of intervals defining 
a real number which has the property of a point of accumulation 
of the set. 

We first observe that it is legitimate to assume that the given 
set is contained in a rational interval; for if this were not the 
case we could replace the given interval by a larger interval with 
rational end-points. We now divide this rational interval into 
two equal sub-intervals. At least one of these contains an infinite 
number of points of the set. For if this is not the case the original 
interval contains only a finite number of points of the set, and 
the hypothesis is contradicted. We take the sub-interval con- 
taining an infinite number of points of the set, or, if such occur 
in both, we take one or other of the sub-intervals, and divide it 
into two equal sub-intervals. Just as before, at least one of 
these sub-intervals contains an infinity of points of the set. 
Hither this one, or one of the two containing an infinity of points 
of the set, is now sub-divided, and so on. In this way a nest of 
intervals (a,|b,) is constructed; for each interval taken is con- 
tained in the previous one and the length of the nth interval is 
one 2"-th part of the length of the original interval. This nest 
of intervals defines a real number €. It will be shown that é 
is a point of accumulation of the set. 

Consider any rational neighbourhood (r| s) of €, so thatr <<é<s. 
Then from a certain number n, onward we must have r< An, 
and from another (possibly different) number n, onward bn, <8. 
In any case, if n > n, and also m > ng, then (a,|b,) is contained 
in (r|s). The construction of our nest (a,,|6,) shows that each 
interval of the nest contains an infinity of points of the set, and 
therefore the arbitrary rational neighbourhood (r|s) of € also 
contains an infinity of points of the set. But this asserts precisely 
the fact that ¢ is a point of accumulation of the set. 
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4, Upper and Lower Points of Accumulation. Upper and Lower 
Limits. 


In the construction which has just led us to a point of accu- 
mulation of a bounded infinite set, we might have made the 
restriction that the second interval (that with the larger numbers 
as its end-points) should always be chosen whenever it contained 
an infinity of points of the set. If this were done, the nest of 
intervals obtained would define a perfectly definite point of 
accumulation B of the set. This number 8 is the greatest of 
the numbers corresponding to points of accumulation of the set. 

This follows at once from the remark that there can only be 
a finite number of points of the set in any interval to the “ right ” 
of each interval (a,|6,) of the nest described above. 

If yis an arbitrary number greater than f and if n is sufficiently 
large, the number b, is less than y. Only a finite number of 
members of the set can be greater than 6,. Thus y cannot be a 
point of accumulation, so that f is in fact the greatest number 
corresponding to a point of accumulation. It is called the upper 
limit (lim) of the set. 

If in the construction we agree to choose the first interval 
of the two (that with the smaller numbers as end-points) when- 
ever it contains an infinity of points of the set, we arrive in the 
same way at the lower limit (lim) of the set. 

The upper limit B and the lower limit a need not belong to 
the set. For example, in the case of the set of numbers a, = 1/n, 
Qo,-1 = (n— 1)/n, we have a= 0, B= 1, but the numbers 0 
and 1 are not members of the given set. 

In the example just given the set contains no number greater 
than 1. We say that in this case 1, besides being the upper limit, 
is the upper bound G of the set. The general definition is as 
follows: the number G is called the wpper bound of a set of numbers 
if the set contains no number greater than G, and, of every number 
less than G is exceeded by at least one number belonging to the set. 

It is important to notice the distinction between the upper 
limit and the upper bound of a set. Take, for example, the set of 
numbers 1, 4, 3,... . The upper bound is 1 and the upper limit 
is 0, the number 0 giving the only point of accumulation of the 
set. 

We shall now show that every set of numbers which is bounded 
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above has an upper bound. A set of numbers is said to be bounded 
above if there is a number M such that all members of the set 
are smaller than M. We first note that df the set contains a greatest 
member G, then G is the upper bound of the set. But a set which is 
bounded above need not have a greatest member, as is seen from 
the example (n — 1)/n, (n= 1, 2,...). We now assert that if the 
set has no greatest member, its upper limit is also its upper bound. 

For suppose the set contains a number z > 8. We consider 
all members of the set which are not less than x. There can only 
be a finite number of these, for otherwise the interval (a|J/) 
would contain an infinity of members of the set and thus at least 
one point of accumulation, contrary to the assumption that P 
is the upper limit. Among the finite number of members of the 
set which are not less than x there would be a greatest one, and 
this would at the same time be the greatest member of the 
whole set. Thus we should be thrown back on the case already 
dealt with. It follows that if the set contains no greatest member, 
then no member of the set exceeds the upper limit. The number 
B also fulfils the second condition that it should be the upper 
bound. For suppose that y is any number less than f; then the 
interval (y|M) is a neighbourhood of £. But since f is a point 
of accumulation the neighbourhood contains an infinity of points 
of the set, all greater than y. 

The lower bound g of a set of numbers is correspondingly defined 
as that number which is not greater than any member of the set, 
and which has the property that every number greater than g 
is also greater than at least one member of the set. Every set 
which is bounded below has a lower bound, which is either the 
least member of the set or else the lower limit of the set. 


5. Convergent Sequences. 


We consider sequences of numbers a,,a9,..., always assuming 
that they are bounded. The principle of the point of accumulation 
shows that the set of numbers a,, a, ... has at least one point 
of accumulation. A sequence of numbers is called convergent 
if it has only one point of accumulation a. This number a is 
then called the limit of the sequence, and we write 

lim a, = a. 


u—> oO 


The following definition is clearly equivalent to the one just given. 
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A sequence of numbers a, ag, ... has the limit a if, and only if, 
every neighbourhood of a. contains all the members a.,, of the sequence, 
with the possible exception of a finite number of members. 

For if the bounded sequence a, has only one point of accumu- 
lation a, then only a finite number of members can lie outside 
any neighbourhood of a; otherwise there would be some other 
point of accumulation. Conversely, if all neighbourhoods of a 
contain all the numbers a, with only a finite number of excep- 
tions, then the sequence a, is certainly bounded. It can only 
possess the one point of accumulation a. For if a’ were another, 
we could choose quite separate neighbourhoods of a and a’, 
and in each of these there would be an infinity of numbers 
belonging to the sequence. This would contradict the hypothesis 
that only a finite number of members of the sequence lie outside 
any neighbourhood of a. 


A sequence which does 
not possess a limit should 
not be regarded as any- 
thing abnormal. On the 
contrary, the existence of a 
limit is in a sense excep- 
tional, For example, the 
sequence whose members 
are doy, = 1/n, AQn_1 = 
(n—1)/n, n=1, 2... 
has two points of acoumu- 
lation, namely 0 and 1. 


ee 
e 


. 


RRO 
teutetes 
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ed 
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The aggregate of the positive rational numbers can be regarded 
as @ sequence of numbers, though we must first entirely dislocate 
the ordering by magnitude. The simplest way to arrive at such 
a sequence is to order the members by means of the array in fig. 1. 
The line drawn in the figure shows the order in which the numbers 
should be taken, any number which has already appeared in the 
sequence being disregarded. As has already been mentioned, the 
set of all rational numbers has every real number as a point of 
accumulation. 

The concept of convergence enables us to make a very useful 
deduction from the principle of the point of accumulation. Jf M 
is a given bounded infinite set of numbers with E as point of accumu- 
lation, then M contains an infinite sequence a1, a2, ... of numbers 
converging to the limit &. 

20 (2912) 
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To prove this we assume that € is given by a nest of intervals 
(a,,|b,) where ag << &<b,. Since € is a point of accumulation, 
(a,|6,) contains an infinity of points of M. We choose one of 
these and call it a,. Again, (a,|5,) contains points of M. We 
choose one of these and call it a,, and so on. The resulting sequence 
04, Gs, ... i8 bounded and can have no point of accumulation 
other than é. It therefore converges to the limit €. 

We now call attention to the two following theorems on con- 
vergent sequences, which, though simple, are important in what 
follows. 

If the sequence a, ag, ... converges to the limit a, then every 
infinite sub-sequence converges to a. For instance, a, ag, a5, ... 
converges to a. 

This follows immediately from the observation that any point 
of accumulation of a sub-sequence must be a point of accumulation 
of the original sequence. An infinite sub-sequence must have at 
least one point of accumulation, and this can only be a. 

Tf a4, ag, ... and By, Bo, ... are two sequences with the same 
limit y, then the mixed sequence a,, B,, 2, Ba, dg, . . . converges to y. 

Any neighbourhood of y contains all the numbers a,, and all 
the numbers f,, with the possible exception of a finite number 
of members of each sequence. It therefore contains all members 
of the mixed sequence, except possibly for a finite number of 
these. 


6. Bounded Monotonic Sequences. 


A sequence of numbers aj, as, . . . is said to be monotonic if 
either 
On S Onty 
for all values of n or 
On = Onty 


for all values of n. In the first case we say that the sequence is 
monotonic non-decreasing, and in the second that it is monotonic 
non-increasing. 

We now prove the important statement that every bounded 
monotonic sequence is convergent. We may restrict ourselves to 
the proof for the non-decreasing sequence. The other case is 
exactly similar. 

Since every bounded sequence has at least one point of 
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accumulation, we need only show that our monotonic sequence 
cannot possess more than one. Suppose, then, that there are two 
such, a and a’, say, and that a < a’. About a and a’ we construct 
two quite separate neighbourhoods U, and U,. Each must 
contain an infinity of members a, of the sequence. Take one 
of the members contained in U,,, say a,. Now let a, be the first 
member beyond a, which lies in U,. There must be such a 
member, since U, contains an infinity of members. Now all the 
members in U, are smaller than any in U,. It follows that 
a, > a,, which contradicts the hypothesis that the sequence is 
non-decreasing. 

We may add the following remark: if a,, a), ...i8 non- 
decreasing and bounded, then lim a, = ay for every N. For only 


n—>o 
a finite number of members a,, that is to say a,, dg, ..., Gy-1) 
can be less than ay. Therefore the limit is not less than a,. 
In the same way, we see that the limit of a non-increasing sequence 
is not greater than any member of the sequence. 


7. Cauchy’s Convergence Test for Sequences of Rational 
Numbers. 


Before we can lay the foundations of calculation with real 
numbers we need a convergence test which is not restricted to 
sequences of rational numbers; but we cannot formulate this 
until we have defined subtraction for real numbers. We shall 
therefore prove the convergence test for rational numbers here, 
and return to the general case in sub-section 9, p. 585. 

The test in question is as follows: 

A sequence of rational numbers a,, a, . . . 18 convergent of, and 
only if, corresponding to every posite number «, however small, 
we can find a number N(c) such that for every n > N andm >N 

| Gn — Om | <e. 

We shall first show that if this inequality is satisfied for all 
sufficiently large numbers m and n, then the sequence is con- 
vergent.* The boundedness of the sequence is proved as follows. 
We take the special value «= 1. Then for a sufficiently large 
value of ” and all sufficiently large values of m 

| @_— Om) <1. 


* Attention must be drawn to the fact that the elements of the sequence a,, Oy 
. are assumed to be rational, but that this is not the case with the limit a. 
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With a finite number of possible exceptions, then, all the 
numbers dy, lie in the interval (a, — 1|a,-+ 1). Thus a properly 
chosen interval will contain all the numbers @,, without exception. 
The principle of the point of accumulation shows that the sequence 
has at least one point of accumulation. We have still to show 
that there cannot be more than one. Suppose there are two, 
a and a’. About a and a’ we could construct quite separate 
neighbourhoods (c| d) and (c’| d@’) so that 


cca<d<d<d<d, 


where we assume, as we may without restriction of generality, 
that a<a’. Since a and a’ are assumed to be points of accu- 
mulation, (c|d) contains an infinite number of points a, and 
(c’|d’) contains an infinite number of points a,,. Thus, in par- 
ticular, for an infinite set of values of n and m we have 


Om —- A, ze —d> 0. 


But this contradicts the hypothesis, which shows that for all 
sufficiently large values of m and 


| Gm — On{<e' —d. 


Hence the sequence has one, and only one, point of 
accumulation. 

We next show that if the sequence a,, a, .. . converges to a, 
then for every «> 0 and for all sufficiently large values of n 
and m 


| @n— Om |< €. 


We take a neighbourhood (c| d) of a, whose length (d — c) is less 
than or equal to «. If N is suitably chosen, then whenever n 
exceeds N, ad, lies in (c|d). Thus if n> N and m>N, both 
a, and @,, lie in (c|d). From this it follows that 


| an — @,|<d—ceSe. 


8. Calculation with Real Numbers. 


So far our work has given us the definition of real numbers 
by means of nests of intervals, and their ordering by magnitude. 
The theorem last proved provides a simple means of defining the 
rules of arithmetical calculation with real numbers. 


RULES OF CALCULATION : 581 


Let a real number a be given by a nest of intervals (a,|5,). 
Since the intervals form a nest, the numbers a, form a monotonic 
non-decreasing sequence and the numbers b,, a monotonic non- 
increasing sequence. These sequences are bounded; for we may 
note that every a, is less than or equal to b,, and every 5, greater 
than or equal to a,. The sequences therefore converge. In both 
cases, moreover, the limit is the real number a. For every neigh- 
bourhood of « contains all the intervals (a,,|b,), except possibly 
for a finite number of these, and thus the neighbourhood contains 
all but a finite number of members of the a,, and the b,, sequences. 
We may therefore say that every real number can be exhibited as 
the limit of sequences of rational numbers. 

If now we wish to define any operation of arithmetic for two 
real numbers a and £, we choose two sequences a, and b, of 
rational numbers with the limits a and f respectively. We 
perform the operation on the pairs of numbers a, and b,, and thus 
obtain a new sequence. When we have proved that this sequence 
has a limit, we shall say, by way of definition, that it is the result 
of the operation on the two real numbers a and f. 

Let a and £ be two arbitrary real numbers and let lim a, = a 


n> 0 
and lim 6, = 8. We consider the sequences @y-+ by, Gn — bn, 


n—> 0 
a,b,, and 1/a,. If we can prove that these sequences converge, 
we can set up the definitions 


a+ p= Ho Gar by), 
a— B= lim (a, — 6,), 


aB — Pep nba), 


in (2), 


The convergence of these sequences will be proved by means of 
Cauchy’s convergence test. 

Tt follows from the convergence of a1, a, . . . that if ¢ is a given 
positive number and » and m are sufficiently large, say n > N, 
and m > N,, then 


| @y — Om | < €/2, 
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and, from the convergence of 6,, by, ... , that if m and m are 
sufficiently large, say n > N, and m > Ng, then 
| Bn — Om | < €/2. 


If N(e) denotes the larger of the two numbers N, and Ng, then 
if n> N(c), m> N(e), 


| (n+ Sn) — (m+n) | S| a— Am | + | On— bn <S+5=€ 
and 


| (42—82)— (an — Bn) | S| An— Om | + | ba—bm| <5 += € 


By Cauchy’s test both the sequences a, + 6, and a, — b, 
converge. 

To prove that a,b, converges, we must first notice that the 
numbers a, and 6, form bounded sets. There are therefore two 
positive rational numbers A and B such that for all values of n 


ja,)| SA, |b,) SB. 
Now 
| OnDn = OmPrn | = | An(bn —_ Bm) + Din(On —~ Gm) | 
S| aq | | On — Om | + [Om | | tn — Om | 
SA|ba— On| + Bl Gn — Om |. 
Since the sequences aj, a,,... and bj, by, ... converge, we can 
find numbers N, and N, corresponding to any given « > 0, such 


that 
| dy — Om |< ¢/2B when n>N,andm>N, 


and 
|b, — bn |<<«/24 when n> WN, andm> Ng, 


Thus if m and m are both greater than the larger of the two 
numbers N, and N,, the above inequalities hold simultaneously. 
We have therefore 


— Amd A+B =e. 
| a,b, Gy! m| < sa 2B € 


Cauchy’s test shows at once that the sequence a,b, is convergent. 
We now suppose that a +0 and lima,=—a. We have to 


n—> 2 
show that 1/a, converges. It is first necessary to show that 
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if n is sufficiently large, |a,| is greater than a positive number p 
independent of n. We take a rational neighbourhood of a which 
does not contain 0. This is possible, since a + 0. From a suitable 
n == Mm onward all the members of the sequence a,, a, ... lie 
in this neighbourhood. This shows that, for n > no, | a,{[= p, 
where p is the absolute value corresponding to the end-point of 
the interval that is nearer to 0. The convergence of the sequence 


—, —,... is not affected by the omission of the first n) members, 


and we may therefore now assume that for all values of n 


[a,{2p> 0. 
We observe that 
dal | em — tn | | am — On| < | m— an) 
an an | Gnd | | an | | Gm | pe 


Let € > 0 be given. If N is suitably chosen, then, since aj, ay, .. 
converges, % > N and m > N give 
| Gm — On| < ep’, 
so that 
1 1 


an Om 


<P ue, 
P 
This proves the convergence of 1/a,, provided that a = 0. 

It is obvious that any real number may be exhibited as the 
limit of more than one sequence of rational numbers. It might 
be thought that the definitions given above do not define the 
arithmetical operations uniquely. For instance, suppose that 

lim a,= a and lim b,= f give one representation of the numbers 


n oO n—> 20 
and B and lim a,’=a and lim b,’= B another. Then possibly 


n—>o n—>o 
the two sequences a, + b, and a,'-+ 6,’ might have different 
limits. (We have proved that they do have limits.) We shall 
now prove that this difficulty does not arise. It will be shown 
that if 


lim a, = lima,’ and lim 8, = lim 6,’, 
n—> ao n—> o u—>o n—>o 
then 
lim (a, + b,) = See 1 (An! + 6,’); 
a> wo 
lim (a,0,) = im. \ (an bn s 
a> 2 
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u—>o 


The proof is very simple. It has already been shown that if 


lim a,= lim a,’ = a, then the mixed sequence ay, ay’, @q, de)... 
n-> © n—>o 
has the limit o. In the same way, we see that b,, b,’, by, be’, . . 


converges to B = lim b, = lim 8,’. From this and the above 


n—> oO N—> 0 
theorems we find that the mixed sequences a, + b,, a,’ + ,’,..., 
and a,b,, a,b)’, ... and, if a = 0, * = ... are convergent. 
It has already been proved that every sub-sequence of a given 
convergent sequence converges to the same limit. From this it 
follows that the sequences 


a, +0, dy +b... and a,’ +b,’, ay + by',..., 


which are sub-sequences of a convergent sequence, must converge 
to the same limit. In the same way, 


4b, Gaba, ... and a,’b,’, de’b,’, ... 
have the same limit, and the same is true of 
Late ee and a : ek 
Gy a 


The results just obtained allow us to settle another important 
question which is connected with our definitions of the operations 
of arithmetic. 

The class of real numbers contains the rational numbers. In 
the course of our definitions of operations on the real numbers 
we have thus incidentally defined these operations for the rational 
numbers, But we began by taking the operations on rational 
numbers as known. We must therefore verify that the new 
definitions do not give rise to any contradiction in the case of 
the rational numbers. What we have to show is that if lim ¢,—a 
and lim b, = 6 are rational numbers, then aad 

n—>o 

lim (a, + 6,)=a+.b, 
n> wo 
lim (a,},,) = ab 


ao 


RULES OF CALCULATION 585 
and, if a + 0, 


It should first be noticed that a rational number a is the 
limit of the rational sequence a,a,.... For the two sequences 
ay’, dy’, ... and b,’, b,’, ... we may take the special sequences 
a,a,...and 6, b,... . The above theorems then yield 


lim (2, + b,) = lim (a2 + |) =a t+), 
n—>oa n—> ao 


lim (a,b,) = lim (ab) = ab, 
n—>o n—> oO 


lim + = jim 2) 
n—>o ly, naa a 
which is the required result. 

It need hardly be mentioned that, as a result of our definitions, 
all the rules of calculation that hold for rational numbers also 
hold for all real numbers. We have only to apply the rules to the 
rational numbers forming the sequences. Let us, for instance, 
prove the distributive law, a(8 + y) = aB + ay. 

Let a = aa L Om p= ar 1 De y= bie 1 ons Then the left-hand 


side of the "equality to be proved i raga {an(bn + ¢,)}, and the 
n—>o 
right-hand side is ia 1 (nbn + a,¢,). But since the distributive 


law is true for the. rational numbers, the two sequences are 
the same, and this must also be true of their limits. 


9. The General Form of Cauchy’s Convergence Test. 


We return to Cauchy’s convergence test, which we have 
already proved for rational sequences on p. 579. Now that the 
operations of arithmetic, in particular subtraction, have been 
established for real numbers, we can formulate the convergence 
test quite generally for real numbers. The sequence a4, ag, ... is 
convergent if, and only if, for any given « > 0, we can find an 
suffic N(e) such that whenever m and n are both greater than N(e) 


| Gn — Gm | <e. 


The proof is exactly like that given on p. 579, and need not 
be repeated. 
20° (912) 
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The following point is of great theoretical importance. Cauchy’s 
convergence test contains in its enunciation a means of estimation 
of error. For if we are given the sequence and know the number 
N(e), we can state at once that the limit of the sequence lies 
between the numbers a, + ¢ and a, — e whenever n > N(e). 

In this respect Cauchy’s test differs from the test for mono- 
tonic sequences. The latter proves the existence of the limit, 
but it gives no means of estimating the limit. Thus in proofs 
of convergence which depend on this test any estimation of the 
limit (and theoretically it is always necessary to give one) must 
depend on separate and extraneous considerations, 
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6. There exists an ¢ > 0 such that for every A there is an A’ > A 
such that 


ve a] 
(x, y) dy 
A 


tor some value of z. 


Appendix, § 6, p. 338. 
1, Substitute 2” = aE, y™ = b™y, 
3. Integrate first with respect to y and z: 
I'(2n)1(3n) 
= 3 -~____—, 
7 4? Din)T(4n) 


4. 2R4B(3, A) = aR 


5. Show that 
‘Se 2, (ays 
Oan(22) = 32°90, (2)Gq(% + 8) Sencana 5 


then let n —-> © and apply Wallis’s formula (Vol. I, p. 225), 


CHAPTER V 
§1, p. 359. 
1, eé sin». 
2. Letu=. il (a cosy + y siny) 
. ary y+ysiny), 
v= aay © 5 (—esiny + y cosy), 


d let u, and v, be defined by the equations 
+ % 


eee appt eae 


Then wu, and », are twice continuously differentiable (and that at the 
origin also), and (1)y = (%),- Hence f u,dx + v,dy = 0 and 
oO 


= $de = = dy — —4 da =2n, 
[emmttn lap ar ge 


by the footnote on p. 359, 


640 ANSWERS AND HINTS 
§ 5, p. 392, 
1. (a) Cf. Ex. 3, p. 37. 


(c) Let 2 be an arbitrary region and » an arbitrary function vanishing 


on the boundary of R. Then by Green’s first formula 


f f (Ug,%, “Ue .Pog + UsrgYarg) O01 Tite ity 
2 


= — ff feArude,de,de, = — ff [rduveetcdr dpsdpy 
z B 


Now 
ep. ap, ap, 
tha, = Moy 3 + “00 ae, * “05 bn, 
= i U2 8 
= Up, — + Up, Cy + Dg 3 
and 
a; % Gs 
ta OS oy 
Hence 


Hi i (Moy? Ft tg,0ag + tess ing) Ary dite dry 
=/S{G« p,m, + 6 Mota, + x You det, dat, dary 


, | 2% [2sex eye 
~fff( e Up, %m, + “ey Uy Yo + V2 9,5) dp, dp, dp, 


=ff fo Up, + Ver, + Usp.) 4p dp, Apo, 


Ve, eye , 


where we write U, = D,° 


& 


Applying Gauss’s theorem to the vector (U,», U,v, U,v), we obtain 


ov. ou aU 
-fff Bp, a+ ont a) vdp, dp, dps. 
pt 


Thus for an arbitrary v vanishing on the boundary of R we have 


fff foauveree ap, dp, dp, 
= ff fo(@ 4844 2 ond 
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and hence (cf. lemma I, p. 499). 
oe ( aU, , 2Us 


1 
Os oe ot) 
ap, OD, Ops C1020 


- el tfE DAES) (Ea) 
(d) Use Ex. 6(¢), p. 158. 
H(ty — 4) te — ty) ta — te) Au = (ty — 2) (6) x (Vv (4) x) 
(pS LV 94) 5 (V= 9%) mn) + (t.— 1) Votes g (Vote) mp 


where 9(x) = (a — a)(b — x)(c — 2). 


§ 7, p. 401. 


i ffia= (at gt ays J frtnavee 


where the volume integral is to be extended throughout the upper half of 

the ellipsoid. (The base of this half-ellipsoid contributes nothing to the 
rfl 1 2 

urface integral): — (- —+-, ) 2, 

surface integral) 4 ae at zs abe’ 


2. Since H is a homogeneous function of the fourth degree, we have 


af fads =f fet, + yHy + 2H)d8 
= [Raf f fanaeayae 


=f f [ter eat agtay) + yay t at a5) + Ata + ao + ae) dedy de 
4n 
B (ti + te + as + ty + og + a). 


Appendix, § 2, p. 406. 

1. The two equations u = f,, v = f, can be solved for x and y, since 
a(u, v) 
az, y) 
Ly = Yy C= Ty. Hence a function g exists such that a= g,(u, v), 
¥ = Gy(u, v). 
ip a ig ee 

(a? + y?) V (a? + y? + 2?) (@+P)V(e+ + 2) 


+0. Leta= o(u, v), y= t(u, v); since u,—= vy, we have (cf. p. 143) 


7) (B912) 
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Miscellaneous Examples V, p. 407. 


2. If (& n) and (2, y) are rectangular co-ordinates in II and P respec- 
tively, then the motion of the point M(x, y) can be described by the 
equations § = xcosp — ysing + 4, y= zsing + y cosp + b (ie. by a 
rotation and a translation). Then 


S(M) = A(x? + y*) + Bu + Cy + D. 


(x) If A= mn + 0, we have 8&(M) = na[(x — a)* + (y — yo)*] + S(O), 
where C is the point x = x = —B/2nn, y = yy = —C/2nz, hence A, B, 
C, D have the values in Ex. 1. (8,) If A = nx = 0, but B? + C? > 0, then 
Ba + Cy + D 
VBE 
where 4 = \/ B? + C? and A is the line Br+Cy+ D=0. (8,) If, 
A= B=C=0, we have S(M)= D = constant. ; 


Sy=VB+C = Ad(M), 


3. For the motion of the plane P rigidly attached to the connecting- 
rod AB we have n = 0, S(A) = 0, S(B) = nCB* = my*®. Hence A passes 
through A, and by symmetry A is perpendicular to AB at A. Hence 
S(M) = wy*t1d(M), where 1 = AB. 

4. For the motion of the plane P rigidly attached to the chord AB we 
have n= 1, S(A) = S(B) = S= area of IT. The point C of Steiner’s 
theorem is therefore equidistant from A and B and S(A) = nC'A? + S(C), 
S(M) = nCM? + S(C), hence S(A) — S(M) = area of T — area of I” = 
n(CA2 — CM?) = rab. 


5. Ifd is the length of I’, the Frenet formule (p. 94) give 
nw 2 Ee _ : = Px aa 
[Pasa [¥a0= feas= sa = 05 
o t . 
f er ge — if [28]d6 = [x84] — f [x8,]ds 
= — [leds = 0 (ct. p. 85). 


6. Let #2’ = (a, B, y), x = (2, y, 2). If in Gauss’s formula 


Jf font e+ enae~—f f [E+ F + 2)aeayas 


we substitute a = 1,b =c= 0, anda = 0, b= —2z,c= y, we get 


f i ado=0O and d, Hi (yy — z8)do = 0 respectively. 


7. Take rectangular co-ordinates (x, y, z) such that z = 0 is the free 
horizontal surface of the fluid and Oz points downwards. The pressure 
on do is nzdo, where z is the depth of do. By repeated applications of 
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Gauss’s formula in three dimensions, with obvious choices of the functions 
a, 6, ¢, we find for the components of the resultant of the fluid pressure 


f [rede =o, f [eade=0, f fredo=—f faeayde= —V. 


For the components of the resultant moment with respect to the origin oO 
we find, again by Gauss’s formula, 


f for — 28B)do =f f [yanayae = Vy, f foe — xzy)do = 
— ff fededyde = —Viy, f free — yza)do = 0, 


(os Yor 2 are the co-ordinates of the centroid C). 


Now we note that the components of the force f are 0, 0, —V, and the 
components of its moment with respect to O are Vy, —Vaq, 0. 


8. From the parametric equations 


w= a cosu cosy, = bsinucosv, z=csiny 
bd 
(0su<2n, —Fsv<f) 


of the ellipsoid we readily obtain the formule 


pdS = abc cosy dudv, dS/p = D*dudv/(abc cosy), 
where 


D* = 0% cos*u cos?v + a%c? sin? u cos?v + ab? sin? v cos*y, 
10. The integral represents the flat solid angle which the plane z= 0 


subtends at the point M = (0, 0, 1). For a direct analytical proof use 
plane polar co-ordinates. 


12. Verify the identity 


a) cee? =) =0, y2=(e—a)?+ (y— 5)? + (2—e)%, 


for all points (x, y, z) different from (a, 6, ¢). From Gauss’s formula in three 
dimensions we conclude (i) that Q = 0 if X is a closed surface such that 
A = (a, b, c) is outside the volume bounded by x; (ii) that if A is within 
Z, the value of the integral is independent of the shape of &. Taking 
for 5 a sphere with centre A, we easily seo that © = 4m. 


13. The integral 


6Q 2 (45*) eC) =a (a 
—= _ dy d — (——— ) ded — (——— 
éa i ad on + 5a Pad a da\ 8 aedy 


is independent of = and depends only on the boundary I of %, for the 
identity given in the answer to Ex. 12 implies that 


21eCs + SLeCH))+ aL 
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By Stokes’s theorem and the discussion of Chap. V, Appendix, § 2 (pp. 393, 
404), the surface-integral expression for 20./@a may be expressed as a line 
integral Juda + vdy + wdz along I’. Verify that the functions 


satisfy the identities 
a es ou ow _ 8 fb—y fo _ Gu _ 2 (= *) 
dy & a\ 7 


a ox ba yo J ax dy oa 
14. Note the following facts: (1) the value of the line-integral © 
remains unchanged if I is deformed in such a way that I’ never sweeps 
over any of the points (—1, 0) or (1, 0) duriny its deformation; (2) @ = 2x 
if [ is a small circle around (1, 0) oriented counter-clockwise; (3) @ == 27 
if I is a small circle around (—1, 0) oriented clockwise. 


7 


15. Think of C as being a rigid circle made of wire and of I’ as being 
a string. Now deform the string I to a new position IY lying entirely 
within the plane y = 0. The numbers p and are not changed during this 
deformation, and the first formula now follows directly if Ex. 14 is applied 
to the curve I” within the plane y = 0 and the line-segment —1 < # < 1, 
y = 0,z = Oof this plane. The factor 4x (instead of 27, as in the previous 
example) is due to the fact that the solid angle Q increases by 47 along a 
closed path for which p = 1, n = 0. : 

One way of carrying out the above deformation of I' into I’ analyti- 
cally is as follows. Assume that I does not meet the z-axis and let 


z= y(t) cosg(t), y= y(t) sing(t), z= 2(t)(0 St S 2n) 
be the parametric equations of I, Consider now the family of curves 
I(t): @ = y(t) cos[zo(t)], y= y(é) sin[t9(t)], z= 2(t) 


depending on the parameter + which decreases from t= 1 to += 0. 
Note that P(1) = I’ and that IY = I(0) is a closed curve which lies in the 
plane y = 0, Note also that (for a fixed value of z) each point P of I(r) 
rotates about the z-axis as + varies; hence the solid angle Q which C sub- 
tends at P does not vary with t. This implies that Q, — Q, will have 
the same value for I(0) as for I'(1) =. To prove the second formula, 
note that 

[PP’ 4P’] 

Q, Oy = faa [eda.ap fer-f EPR 


. f oe) f j peas 
Soda | PP} rJp [PPP 
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CHAPTER VI 
§ 2, p. 428. 


1. Use the theorem of the conservation of energy, and prove that 
r—>o asi-> om. 


2. If (&, 1) are the co-ordinates with respect to the axes of the ellipse, 
then 
—= acoso = «+ oa 
n= bsino = y 


give the equation of the ellipse; and by the law of areas 


= ab f° — ¢cosw)de. 
0 


[Note that the question ought to read: “, .. the angle P’MP,, 
where P’ is the point on the auxiliary circle corresponding to P, the 
position of the planet . . Pages | 


3, 4. Use the theorem of the conservation of energy and the law of 
areas. 
§ 3, p. 432. 
1. (a) y = tan loge/V(1+ 2%). (b) y=ov(L + e**), 
2. (a) y= ce, (6) y®(2a* + y?) = 0% 
(c) x* — 2cx + y* = 0 (circles). 
(d) are tan(y/x) + ¢ = logV(x*+ y*), or, in polar co-ordinates, 
r= ef+¢ (logarithmic spirals). 
(e) ¢ + log | #| = arc sin(y/x) — : Vv (a? — y"). 
3. If ab, — a,b + 0, we have 
dy oth _ ot boln/e) 
de ay ft by! = + by 9(n/ ey’ 


which is a homogeneous equation. 
If ab, — a,b = 0 or a,/a = b,/b = k, then 


=a oy es ised 
1 a4 dtm at bore) 


and the variables are separated. 


4. (a) 4a + 8y + 5 = cete—8u, 
(b) = ¢ — 3(3y — Tx) — £ log (3y — 72). 
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5. (a) y= ce8iD T+ sinz— 1. (b) y= (x + 1)"(e* + e). 
eee ye ee ai a 


y= aa Ome, 
Wen eee 
6. Introduce 1/y as new unknown function; the equation then be- 
comes homogeneous: 
1— crv 
ceV5 (§ — §V5)—4—4V5 


i= 


§ 4, p. 444. 

1. Use induction. Suppose that a linear relation c,p, + ... + ¢,9,2=0 
holds. Divide by e%* and differentiate (n, + 1) times, if P,,(x) is of degree 
n,. The degree of the coefficients of the other e%:*’s is unchanged, so that 
they remain different from zero. 


2. Multiply both sides of the equation by (1 — n)y~. 


(a) y=cx+ logz +1; (0) y® = ca-3 4 a 


(c) (y+ 4)? = (2? — I). 
3. If we put y = y, + u, the equation reduces to the linear equation 
wu — (2Py, + Q)u = P. 


er" 


ey aa 
e+ f atet@ dar 
0 


4, By equating the right-hand sides of (a) and (b), we obtain the 
common integral y = 2%. 


te 
I 
R 
| 


5. y= 8 — (= Sle) 


To draw the graphs of the corresponding family of curves, first plot the 
two branches of the curve 
y+ee—at@=0 y= +V(e— 2)2), 


which divides the plane into two regions where y’ < 0 and one region 
where y’ > 0. The two infinite branches of this curve are asymptotic to 
the two parabolas y = +2%. Show that all the integral curves are asymp- 
totic to these parabolas by proving the two relations 


f(z c) = —22+ 011) as t>+u(—M<e< w}) 


and 
f(z, c)= 22+ 0(1) as r—>— w (c+ 0), 


where 0(1) denotes a function which tends to zero, 
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6. Put 
Ww—-w=% A-MU=1 W-w=G BW-M=H 
Then 
a’ + Pay, + ys) + Ga = 0, 
so that 
A 
Py, + ¥3) = —@—<, 
Ply, — Ys) = aP, 
or 
Pye Ox. 
a 
Similarly, 
2Py, = bP —Q— > 
Hence 
dlog(a/b 
toe (@/P) _ Pad) = —Pls— va 
and similarly, 


Towel) = —Plys az Ya); 


lo, (ee 5) = const 
24 b ° d 


7. Cf. the relation 
dlog(a/b) _ Plo 
aa (Ys — Ys)s 


in the proof of the preceding example. 1 
Particular solutions of the special equation are y,=-—— and 
1 1+ cet cos” 


Ya coon (1 — ce**) cosa 


8. The common solution e* of (a) and (b) is obtained by eliminating 
y” from the two equations. 


(a) 0,6” + Cet; 
(db) cye® + Ve 


§ 4, p. 449. 


1. From the fundamental theorem of algebra it follows that f(z) may 
be written f(z) = (2 — a,)"(2 — aa)h2.. . (@ — &)Me (cf. Vol. I, p. 230); 
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where the u,’s are positive integers such that u,+...+4,— 2; and 
F(4,) = f(a) = 66 = fH—V(a,) = 0, 


Now 
L(ed®) = f(rjed*, 
On differentiating this relation (u,— 1) times and putting A= a, in 
the result, we get (cf. Leibnitz’s rule, Vol. I, p. 202) 
L(e%s*) = f(a,)e%* = 0 
L(xe%™) = [f’(a,) + af (a,)Je%* = 0 
Ley) = usa) + 2uef'(ay) + 2*f(e) Jer” = 0 


eo © © © «@ o © «© 6 7 8 8 


L(coy~1 ey) = (ee 1 f%-1a,) + (t=) flea, a 


eke (> - iD 1(@,)e%-*| ot,® = 0, 
Wy — 1 
So we have m particular solutions 
eMh, geh® ath lat 
eM, e® safe leet 
EX, eh |. , phe Nee, 
which are linearly independent, by Ex. 1, p. 444. 


Vv 
2. (a) y= ce* + cet * eos + ce? cin, 
(b) y == ye” + come” + coe”, 
(c) y = ce” + cnre® +. cate, 
(d) y = cye* + ce + cgeV 2 + cye~V 22, 
(e) Substitute x = e’: 
Y = Oe + cof. 
3. On substituting in the differential equation, we get 
(Goby — 1) P(x) + (gb, + @1B9)P’(x) + (dob + 4b; + Agbo) P(x) +... == 0, 


and this is an identity if ayby= 1, agb, + 2,0) 0, ..., from the expansion. 
The second case reduces to the first if we substitute y’ for y. 


4. (a) =1—#-++4—...; hence y= P(x) — P’'(z) = 3x3 — 52— 6. 


ey, 
Oe ae 3 hence 
ipa ones 


y = f P(x)dx — P(x) + P(x) — P(x) = —$ +4 2+ fea, 
5. (a) y= ge%, (b) y = Gere”. 
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we 3 7 
a es = = 8 250. 
6. ya e(S +cat i) tae + ae e 


§ 5, p. 405. 
1. (a) Use the fact that the curvilinear integral 


i (Be* + Gay*) dar + (Bx°y + 4y°)dy 


is independent of the path. Integrating between (0, 0) and (x, y) along the 
broken line (0, 0)... (a, 0)... (a, y), we get 


f “Ge + Gay?\dx + (Onty + 4y°)dy = a? + Baty? + yt =o. 
(6) By inspection we find the general integral 
V(1 + a + y*) — are tan(y/x) = ¢. 
2. Here dy/dz is a function of y/x alone. 
3. a2y — Qay* — Icy — 2 = 0 (integrating factor u = 1/y*). 
4. The equation is linear in # and its general integral is (wy? + 1)? = cy. 
The identity 


3 1)? 2 1 
a( a) = a [2y%dx + (3xy* — 1)dy] 


displays an integrating factor of the equation. 


5, (a) + y%@+ ce+ 1=0(—% <e¢ < o) and the line #= 0. 
(b) a? + Qy? =o 
(c) The differential equation of this family of confocal conics (cf. p. 


158) is found to be 
2 — 4? — a? +. Bb 
Fy aes Aa 
ay 


which is unaltered if y’ is replaced by —I1/y’; the family of ellipses 

(—b? < ¢ < o) is orthogonal to the family of hyperbolas (—a? < ¢ < —b*). 
(d) y = log | tan(z/2) | + ¢ and the vertical lines 2 = kr (k an integer). 
(e) The family of curves (tractrix) 


2—o= +(V (a? — y*) — a ar cosh(a/y)) 


y—1=90, 


and the same family reflected in the x-axis. 


6. (a) The family of parabolas y = cz’, 
(b) The family of hyperbolas xy = e. 


7. (a)y=a', (b+) y= —a+ elog(—z) (0>4> —o), 


8. y= ep + aV1-+ p — aparsinhp. 
22% (8912) 
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9.2=coP4+ hy 
y= op + ale? + tn(p + a) — 4 (p+ a). 


2 
Note that for c= 0 this gives the parabola y = 2? — + What is the 
geometrical meaning of this result? 


10. (a) y = sin(x + ¢), singular solutions y = + 1, 
(b) = + Haresiny + yV1i— y*)+e. 


(ec) ¢= F V Ga = py — 2a are tan gf) + 6 


which is a family of cycloids and can be expressed in the parametric form 
a=c-+a(e— sing), y= a(l—cosg). Singular solution y = 2a, 


vy At+y# ‘ : 
(d) w= +f i-y dy+e (—1lSyS1);_ singular solutions 
f = 
y= +1. (The reader should prove that these curves are not sine curves.) 


are eee 72)§ 
1. uN=yVity uo=—EtY” ond the differential 
equation is 


(1+ yy + ky” = 0. 
By the general method this is easily reduced to 


(ay 


de = (¢ an arbitrary constant). 


The various cases, all of importance in the differential geometry of sur- 
faces,* are as follows: 

(1) k= K(> 0), c= —y*(< 0, y? << «*). The curve is everywhere 
smooth, and oscillates, alternately touching the lines y= $k? — y?. 
It looks like a sine curve, but is not one. 

(2) k= «?,¢=0. The curve isa circle of radius « with centre on the 
a-axis, 

(3) k= x, e= y(> 0). The curve consists of a sequence of iden- 
tical arcs, joined by cusps lying on the line y = y, and all touched by 
y=~+/e? + 7%. It looks like a cycloid, but is not one. 


(4) k= —K%(< 0), c= y2 > x. The curve consists of a sequence of 
identical arcs upside-down, with their cusps on y= y and touched by 
y=VP-# 

(5) k= —k*, c= y? = x*. The curve is a tractrix. 


(6) k= —k*®, c= y? <«*. The curve has an infinity of cusps, 
perpendicular to the lines y = y and y = —y alternatively. 


12. Eliminate 6 from the equations obtained by differentiating the 
equation of the circle twice and thrice: (1+ y’”)y’” — 3y’y’2 = 0. 


*See Eisenhart, Differential Geometry, pp. 270-4 (Princeton Press). 
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13. y= xsinaz; singular solutions y= 2 and y= —z. 


14. If y(x) = Xe,2”, then 


Cy 
et Se =1,¢=0; 
6,42 +2) and ¢ cy = 0; 

S (—1y Y 2 

y(2) = Bye * 


If we substitute the power series for cosxt in the expression for J,(x) in 
Ex. 4, p. 223, and interchange summation and integration (why is this 
permissible?), we get 


: aia +1 Fea 
We= 5 Zon UL, vasa 
the value of fC dt is fad) oka pL as is found by putting ¢ = si 
‘Ts 2” y putting ¢ = sint 
and referring to Vol. 1, p. 223. The power series for y(x) and J,(z) are 


therefore identical. 
§ 6, p. 481. 
1. Poisson’s formula gives a potential function u(r, @) inside the unit 
1 
circle, with boundary values f(6). Now u(Z, 6) is also a potential function 


(cf. Vol. I, p. 479, Ex. 3) with the same boundary values, and it is bounded 
in the region outside the unit circle; thus the expression 


ee a da 
1 = 2r cos(0 — a) + 19 
is a solution of the problem. 
2. The potential is 
a+l+ Vets tt ¥ 
elt Vet ete 


Since on the ellipsoid z= la cosp, V2*+ y= IV oF —1 sing, the 
potential is 


the confocal ellipsoids 
2 oy? 
as neta ee | lee 
Pat" (2 — 1) Cat 2) 
are equipotential surfaces. The lines of force are the orthogonal trajectories 
and hence (cf. Ex. 5¢, p. 466) are the confocal hyperbolas given by the same 
equation when 0 Sa 51 and the ratio of x to y is constant. 


652 ANSWERS AND HINTS 
3. Let = be a sphere of radius p and centre (x, y, z), lying inside 8. 


. 1 
Since A () == 0 and Au = 0 in the region bounded by & and S, by Green’s 
theorem (cf. p. 390) we have 


om f [EEA ee— f [Cate 


where in the first integral » is the outward normal to S and in the second 


the outward normal to =. Now on the sphere X we have ee _ “oe 


1 
=— -—, r= const. = 9; therefore 


5 fen kf (Sane 


since u is a harmonic function (cf. p. 475); in addition, 


1 a(yijr), 
~ al fe a do= 75 | [udo, 


and as p —> 0 this expression obviously tends to u(z, y, z), for it is the mean 
value of uw on X. 


§ 7, p. 489. 


1. @) u= f(x) + gy) (f and g are arbitrary functions). 
(b) w= f(a, y) + o(%, 2) + Aly, 2) (f, g, h are arbitrary functions). 


(c) The most general solution is obtained from a particular solution 
by adding the general solution of the homogeneous equation w,, == 0. 


zy 
o= f dé f aE, n)dn + f(x) + gly), where f and g are arbitrary. 
0 0 


2. Apply the linear transformation 


w= E+ y 
y = 36 + 2. 


w= fly — 20) + gle — 9) + et, 
3. (22+ 272+ 1) = 1. 
4. u(x, t) = f(x — at) + g(a + at); then for z 20 


0 = u(x, 0) = f(x) + g(x) 
0 = u,(z, 0) = —af’(x) + ag’(x); 


by differentiating the first equation and comparing with the second, we 


have 
f(x) = 0, g(x) = 0, 


MISCELLANEOUS EXAMPLES 


1. Two vectors x, y (or three vectors x, y, z) are said to be linearly 
independent if a linear relation 


ax + by =0 (or ax + by + cz = 0) 


is possible only when a = b= 0 (or a= b=c=0). They are said to 
be linearly dependent if such relations exist without all the coefficients 
vanishing. Prove the following statements: 


(a) Three vectors x, y, 2 such that any two of them are ortiouoba’ 
to one another are linearly independent. 
(6) The vectors x, y (or x, y, 2) are linearly independent if, and only if, 


[xy] + 0 
% Fy 
(or x[yz]=|4%1 Ye Ys} + 0). 
% 2 


(c) If two vectors x, y in a plane are linearly independent, then any 
vector v in their plane may be written in the form v = ax + by. Similarly, 
if x, y, z are linearly independent, then any vector v may be written in 
the form v= ax + by + cz. 


2. We know already that if x, y, z are three vectors, 


% % Xs 
x[yz] = xzx])= 2[xy]=|41 ¥2 Ys 
my % 2% 


(the common scalar value of these expressions may be conveniently denoted 
by (x, », 2)). Prove the further vectorial identities 

xx’ xy x2 
yx’ yy’ 2’). 
zx’ zy ze’ 
(6) [xy ][x’y’] = (xx’\( yy’) — (x9’)yx’) (cf. Ex. 5, p. 19). 
(c) [xfy2]] = (xz)y — (xyz. 

(2) ([x(y2]], [fz], [2lx]]) = 0. 


Use the last result to deduce that if a plane is drawn through each of 
three concurrent straight lines perpendicular to the plane of the other two, 
the three planes thus obtained meet in a straight line. 
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(a) (x, ¥, 2x’, 9, 2’)= 


588 MISCELLANEOUS EXAMPLES 


3. Let Ox, Oy be a system of rectangular axes ina plane. Let Oz’, Oy’ 
be a second such system and let the angle zOz’ be 9. Prove that the 
passage from one system of co-ordinates to the other is given by the for- 
mulz 

a= 2’ coso—y sing, 2 = xcosp-+ ysing, 
y= 2 sinp+ycosp, y= —xsing + y cosg. 

4. From the result of Ex. 3 deduce the addition formuls 
coa(o + p) = cosg cosp— sing sin}, sin(p+ y)—sing cosy + cosg sin}. 


5. Let Ox, Oy, Oz and Oz’, Oy’, Oz’ be two co-ordinate systems, both 
having the same orientation, the cosines of the various angles being indi- 
cated by the following scheme: 


y v 


a’ 

m& Pr V1 
% Be Ye 
% Bs Ys 


nee 


In Ex. 1, p. 12, and Ex. 9, p. 38, the relations 


a? + 82+ y2= 1, Oty + Bobs + Yo¥s = 9, 
7 + Bo? + y.? = I, get, + Bsh + Yav1 = 0, 
a? + Bs? + ys? = 1, O%y%q + BiB, + Y1¥2 = 9, 
m% Pr Ya 
A=l/a, Be yei=l1 
a Bs Ys 


were proved. A three-rowed determinant A whose elements satisfy these 
relations is said to be orthogonal. 

Prove (a) that to any orthogonal determinant A equal to +1 there 
correspond two co-ordinate systems Ox, Oy, Oz and Ox’, Oy’, Oz’ with the 
same orientation, such that the cosines of the angles between the various 
co-ordinate axes are given by the elements of A. 

(6) That for any orthogonal determinant the relations 


4? + a9 + a? = 1, Br¥1 + Bev + Bsvs = 0 
Bi? + Bo? + Bs? = 1, Y1% + Y2%2 + Ya%s = 0 
ytyitt+yt=1 O81 + 2B. + as83 = 0 


are also satisfied. 


6*. Let Ox, Oy, Oz and Oz’, Oy’, Oz’ be two co-ordinate systems as in 
Ex. 5. Assume that Oz and Oz do not coincide; let the angle zOz’ be 0 
(O<6< x). Draw the half-line Oz, at right angles to both Oz and Oz’, 
and such that the system Oz,, Oz, Oz’ has the same orientation as Oz, 
Oy, Oz. Then Oz, is the line of intersection of the planes Oxy. and Ozx’y’. 
Let the. angle zOz, be 9 and the angle x,Oz’ be and let them be measured 
in the usual positive sense in their respective planes, Ozy and Oz’y’. 
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Prove that the passage from Ox, Oy, Oz to Ox’, Oy’, Oz’ is given by the 
scheme 


a’ y' 2! 
x cos cosy — cose sing sin 9 sin 
— sing sin) cos6 — sing cost) cos6 
y sin cos) — sing sind — cos sin® 
+ cos sin} cos6 -+ cos @ cos tp cos 
z sin) sin® cos) sin® cos 0. 


(Noté that this result holds also for 6 = 0 or x, when 9 and become 
indeterminate with 9+ ~=Z202' or 9— Y= Z2xOx' respectively. 
The angles 9, ), 6 are the so-called Eulerian angles, and our result, together 
with Example 5, shows that the most general orthogonal determinant A 
of value +1 may be expressed “ parametrically ” by means of the three 
variables 9, }, 0, subject to the inequalities 


Os0Sn, OS9 < 22, Osd < 2x.) 
7. Let ABC be a spherical triangle of sides a, b, ¢ and angles A, B, C 


on the “ unit sphere ” (i.e. the sphere of radius unity). From Ex. 6 deduce 
the “cosine theorem ” ; 


cosa = cosb cose + sin’ sine cos A, 


8. Find ‘the angle » between the plane 
Az + By + Cz+ D=0 
and the line ; 
w= tot at, Y= Hot Bt, z= %+ yee 
9. Solve the equations 
22—B8y+42 =4 
4x — 9y +162 = 10 
Sa — 2ZTy + 642 = 34. 
10. Prove the identity 
(a? + b9)(c8 + d) = (ac + bd)? + (be — ad)? 
by forming the product of the determinants 
a b ec d 
—b a —d ec| 
11*, Prove that the value of the determinant 


cos(8-++ «)  cos(8@+ 8) cos(8 + x) 
sin(O+a) sin(0+ 6) sin(®+ -y) 
sin(@ — y) sin(y—a) sin(a — B) 


and 


D= 


is independent of 0. 
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12. Tf A= 2+ y+ 2, B= 2y+ yz+ 2a, show that 


BAB 
D=|B B Aj = (22+ ¥8 + 2 — 3ayz). 
A BB 
13. Show that 
i+2 ate a+-2 at+z2 
eo b+e +2 a+s2x a+«zx 


b+a2 b+2 ts + 2% a+a 
b+ae b+2 b+-2 ita 
is of the form A + Bz, where A and B are independent of 2 Hence by 
giving particular values to z, prove that 
— F0) — 2f (0) p_ SO) — fa) 
a—b oer ie 
where 
f) = (4 — H(t, — Ota — H(t, — b). 
14*. Prove that if u and v are functions of x and v= 1/u, then v'’”=D/u‘, 
where D is the determinant 


15. (a) Show that a function u of the form u(x, y) = f(x) g{y) satisfies 
the partial differential equation 


Wy — Ugly = 0. 
(5)* Prove the converse statement. 
16. Prove that 
t t—?r 
w(e, ye) = FEED MAD ary yt a) 
satisfies the equation 
Au = Uy. 
17. Show that a function wu satisfies the equation 
Uggllyy — Uy = 0 
if its first derivatives satisfy a relation of the form 
F(u,, Uy) = 0. 
18*. Prove that a surface u = f(x, y) generated by straight lines 
meeting the w-axis or, what comes to the same thing, a surface cut in 


straight lines by vertical planes ¥ — 6, satisfies the equation 
x 


Pipe + 22y Ugly + YPUyy = 
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19. Find a 8 =: &(c, 2, y) for the continuous functions (cf. pp. 44-5) 


(a) f(z, y) =V(L + 27 + 2y’) 
(b) f(x, y) =V(1 + e*"). 


20. Show that the functions 


Eons ete tt 
fan= ay yy 2 = syne 


tend to zero if (2, y) approaches the origin along any straight line, but that 
f and g are discontinuous at the origin. 

21. Let C be a smooth curve with a continuously turning tangent. 
Let d denote the shortest distance between two points on the curve and / 
the length of arc between the two points. Prove that d — l= o(d) when 
d is small. 

22. Evaluate 


s=5 = (a+b a 
e=0 b=0 a! bt = aty? 


#¢igtes a>0, y>0. 
a sy 


23. Show by using Euler’s relation (p. 109) that a homogeneous 
function S,,(x, y, 2) of degree n which satisfies Laplace’s equation AS, = 0 
also satisfies the relation 

A(r?™S,,) = 2m(2n + 2m 4- 1)r°™3S,, 
where 


faoett yt a 


24. Prove that the curvature of the curve x = x(é) (¢ being an arbi- 

trary parameter) is given by 
pa 4 ae 
(x’?)3 

25*, Let a twisted curve C be defined by x= 2(8), y= y(8), 2 = a8, 
g being the length of arc of the plane curve x = 2(8), y = (8). Prove that 
the osculating plane of the curve at a point P (cf. Ex. 1, p. 93) contains 
the normal to the cylinder x = 2(s), y = y(s) at P. Show that the curva- 
ture and torsion of C are respectively given by 
ay” = aly ape a(x’ a se ay’) 
l+a@ 
(A curve of this kind is called a circular heliz.) 


26. Find the equation of the osculating plane (cf. Ex. 1, p. 93) at 
the point © of the curve z= cos0, y = sin8, z==f(8). Show that if 


6) = i cosh 48, each osculating plane touches a sphere whose centre 
A ig Pp 


is the origin and whose radius is V(1 + 1/A?). 
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27. A curve is drawn on the cylinder 2% + y? = a*, such that the 
angle between the z-axis and the tangent at any point P of the curve is 
equal to the angle between the y-axis and the tangent plane at P to the 
cylinder. Prove that the co-ordinates of any point P of the curve can be 
expressed in terms of a parameter 6 by the equations 


z=acos0, y=asin®i, z=c+alog sing, 
and that the curvature of the curve is (1/a) sin®@(1 -+ sin?6)t. 
28. (a) Prove that the equation of the plane passing through the 
three points ¢,, t,, tg on the curve 
a= tal, y= bf, z= ct 


is 
32 
a 


— At + t+ t)¢ + (tats + tet + tit) = — ttt, = 0. 


(6) Show that the point of intersection of the osculating planes at 
ty, tg, ty lies in this plane. 


29. Let a, b, c, A, B, C, be the sides and angles of a triangle of area 
8, and let R be the radius of its circumscribed circle. Show that 
ds = R(cosA da + cosB db + cosC de). 


30. Consider a fixed point A in space and a variable point P whose 
motion is given as a function of the time. Denoting by P the velocity 
vector of P and by @ a unit vector in the direction from P to A, show that 


d ; 
q (AP) = —aP. 


31. Let A, B, C be three fixed points and let the components of the 


velocity vector P of a moving point P in the directions PA, PB, PC be 
u, v, w. Let a, 6, c be unit vectors in the directions PA, PB, PO. Prove 
that 


ww Gis om 
PA PA 


da_ 4 (mare, a eatte) od. 
PA PA 


32. Prove that the acceleration vector P of the point P is 


P = ua + Bb + yo, 
where 
em et ay cosAPB 1 ) + wo (“a ~ pm) 
PA PB PA PC?’ 
with two similar expressions for 8 and y. 


33. Find the envelope of a variable circle in a plane which passes 
through a fixed point O, and whose centre describes a given conio with 
centre O. 
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34. If I is a plane curve and O a point in its plane, the locus I” of 
the orthogonal projections of O on a variable tangent of IT’ is called the 
pedal curve of I’ with respect to the point O. Prove that if the point M 
describes the curve I’, the pedal curve I” is the envelope of the variable 
circle with the radius vector OM as diameter. 


35. What is the envelope of the variable sphere with the radius vector 
OM (cf. Ex. 34) as diameter? 


36. What are the envelopes of the variable circles and spheres of Ex. 34, 
35, if T is a circle and O a point on its circumference? 


37. MM’ is a variable chord of an ellipse parallel to the minor axis. 
Find the envelope of the variable circle with MM’ as diameter. 


38. A plane moves so as to touch the parabolas 
2= 0, y? = 4a and y= 0, 27 = 4m, 
Show that its envelope consists of two parabolic cylinders. 


39.+ Generalize the investigation of § 1 of the Appendix to Chap. II 
(p. 204) to functions of » variables, proving the following results. Let 
flay «++» %,) be three times continuously differentiable in the neighbour- 
hood of a stationary point 2%, = 2,°,..., %, = %,°, that is, a point where 
Se, = fa, = +++ = Fn, = Consider the second total differential of f 
n 
at the point 2°, df°= 2D fx, da3 this is a quadratic form in tho 
kel 
variables dx,,...,4%,. If this quadratic form is non-degenerate, that is, if 


Fie, > ++ Saye, 
D=| : : [+9 
Sogn +++ Sent 
then d?f° may be (1) positively definite, (2) negatively definite or (3) indefinite. 
Prove that these possible cases correspond respectively to the following 
properties of f at the point (2°): (1) f has a minimum, (2) f has a maximum, 
(3) f bas neither a minimum nor a maximum. 


40. Consider the function of two variables f= (y — 2*)(y — 22%), 
which is stationary at the origin O (x = y = 0). Prove (1) that along any 
straight line through O, f has a minimum at O, (2) that f, considered as a 
function of (x, y), has neither a minimum nor a maximum at O. 


41. Let P,P,P, be a plane triangle with all three angles less than 
120°. Prove by the criterion of Ex. 39 that at the point P interior to P,P,Ps 
such that 1 P,PP,= 1 P,PP,= ZP,PP,=120°, the sum PP,+ PP,+ PP 
is actually a minimum (cf. Ex. 4, p. 187). 

42. Where does the minimum of the sum PP, + PP, + PP, ocour 
if in the triangle of Ex. 41 the angle P,P,P, is greater than or equal to 120°? 


+ For Ex. 39, 41, 43, and 44, the reader is assumed to be familiar with the 
elements of the theory of quadratic forms. 
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43. To investigate stationary points of f= f(2,,...,2,), where the 
variables satisfy the relations 


912, - + > Bq) = 0,006 Pm(T, +29 %_) = 0 (mn), . (1) 


we may assume that we have found numerical values for the variables and 
the multipliers ,, such that F = f+ 49,-+...+ AmPm satisfies the 
equations 


OF ex, = 0,..., OF /Ot,=0, » . . « . (2) 


and such that the Jacobian of 9,,...,9,, with respect to the variables 
Xy,.. +, Ly is not zero. To apply the criterion of Ex. 39 we may proceed 
as follows. Regarding 2,41, .-., % a8 independent variables, by differen- 
tiating (1) we can obtain the first and second differentials of 2... , %m 
as functions of 7,+43,...+,%, and finally introduce these values into 


of= 2% Fig ts An + fo, Pry + eet fay Pe, + + (3) 


Prove the following second rule, not involving the computation of 
the second differentials d®a,,...,d°x,,. Regarding 2,..., %, a8 inde- 
pendent variables, consider 


CF = LP p59, 1; Ay = f+ Ao, tee et Ain B Ooms 


compute da, - -, 4x,, from the equations 
do, = Pye, I + coat Pus, Wn = 0 (u=1,...,m) 


and introduce these values into d?/’, thus obtaining a, quadratic form 38°F 
in the variables dx,,41,...,dx,. If this quadratic form is non-degenerate, 
then f has respectively a minimum, a maximum, or neither of these, ac- 
cording as 85°F is positively definite, negatively definite, or indefinite. 


44. In the problem of finding the maximum of f = a,2,...2, sub- 
ject to the condition 9 = 4+ a,+...+2%,—a=0 (a> 0), the rule 
of undetermined multipliers gives a stationary value of f at the point 
t= %=...=%,=a/n. Apply the rule of Ex. 43, instead of the 
consideration of the absolute maximum, to show that f has a maximum 
value at this point. 


45. Apply the criterion of Ex. 43 to prove that among all triangles of 
constant perimeter the equilateral triangle has the largest area (cf. Ex. 2, 
p. 200). 


46. The curve x + y8 — 3azy = 0 has a double point at the origin. 
What are its tangents there? 


47. Draw a graph of the curve (y — 2*)? — 25 = 0, and show that it 
has a cusp at the origin. What is the peculiarity of this cusp as compared 
with the cusp of the curve 2? — y? = 0? 
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48. (a) Prove that if all the symbols denote positive quantities the 
stationary value of iz + my + nz subject to the condition z” + y? + z 
= 0? is 

(12 + m2 + nite, 
where g = p/(p — 1). 

(5) Show that the value is a maximum or minimum according as 
p2l. 

49. Find the values of z, y which make 


228 + (2 — y)? — by 
stationary. 


50. Prove that if & is a closed convex curve and ABC is the circum- 
scribed triangle of least area, then the points of contact of & with the 
sides of the triangle are the centres of the sides. 


51. Show that each of the curves 
(x cosa — ysina — 6)? = c{z sina + y cosa)’, 
where « is variable, has a cusp, and that all the cusps lie on a circle. 


52. If C= f(a, 6) is a true maximum or minimum of f(z, y) subject 
to the condition 9(z, y) = C’, show that in general C’ = 9(a, b) is a true 
maximum or minimum of 9(z, y) subject to the condition f(z, y) = C. 

53. A circle of radius a rolls on a fixed straight line, carrying a tan- 
gent fixed relatively to the circle. Taking axes at the point of contact 
where the moving tangent coincides with the fixed line, show that the 
envelope of the tangent is given by 

az = a(@ + cos6 sin 6 — sin6) 
y = a(cos*6@ — cos). 

54. If the co-ordinates (x, y, z) of a point on a sphere are given by 

the equations (cf. p. 160) 
z=asinOcose, y=asinOsing, z= acosd 

show that the two curves of the systems 0+ p=«, 0—o= 8, 
which pass through any point (8, @) cut one another at the angle 
arc cos {(1 — sin?@)/(1 + sin®6)} (cf. p. 164). 

Show that the radius of curvature of either curve is equal to 

a(1 + sin?6)?/(5 + 3 sin®6)t 

(cf. Ex. 24), 

55. If 


{2 
f(z) =f log (1 — 2? cos?0)d0, 
0 
prove that f(x) is finite if 2? <1, and that if 22<1 
d 7/2 @ ; 
fe) = if F, {low (1 — 2 cos*6)} dd, 


and hence evaluate the integral. 
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56. Show that the area = of the right conoid 
w==rcos6, y=rsinO, z= f(6), 


included between two planes through the axis of z and the cylinder with 
generating lines parallel to this axis and cross-section r = f/(6), and the 
area of its orthogonal projection on z = 0 are in the ratio 


[V2 + log(1 + 0/2)}: 1. 


57. Assuming that the earth is a sphere of radius R for which the 
density at a distance r from the centre is of the form 


and the density at the surface is 2} times the density of water, while the 
mean density is 54 times that of water, show that the attraction at an 
internal point is equal to 


where g is the value of gravity at the surface. 


58. Let (21, 41), (%2» Ye), (%3 Ya) be the vertices of a triangle of area 
A (the order of the suffixes giving the positive orientation). Prove that 
the moment of inertia of the triangle with respect to the x-axis is given by 


A 
A (ys? + ye? + ys + yite + Yes + Ysi/1)- 


59. A hemisphere of radius @ and of uniform density ¢ is placed with 
its centre at the origin, so as to lie entirely on the positive side of the 
ay-plane. Show that its potential at the point (0, 0, z) is 


6 [(at + 21 — at + 8 at — Enptit0<s <a 

and 

Pre | (at + 2 +a 5 ate = @ meet ite >a, 
z 
60. Sketch the curve 

Be endl oat 
1+” 1+? 
and calculate the area included by the curve. 


-1S,S41 


61. Prove that the attraction at either pole of a uniform spheroid 
with density p and semi-axes a, a, c is equal to 
2e 
anp [ r (1 — cos6) dr, 
0 


where 
r == 2a%c cos 0/(a? cos?6 + c* sin? 6), 
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62. In the integral 
(20— 4x) /(8—x) 
t= fea 2 f ty — 4)/dy 
Afee 
change the order of integration and evaluate the integral. 


63. (a) By transforming to polar co-ordinates, show that the value 
of the integral 


asing va—¥ TT 
k= [ (fe lgtet + sae} (0<8 <3) 


is a®B (loga — 4). 
(b) Change the order of. integration in the original integral. 
64. Find the volume V cut off from the right cone 
et y= (h—z) tanta, O52 Sh, 
by the right cylinder whose base is the curve 
(h tanz — r)® = A® tan? « sin’6 cos? 6, 
where z, 7, 0 are cylindrical polar co-ordinates, the volume nore outside 
the cylinder and inside the cone. 


65. Show that for the hyperbolic paraboloid z = zy the value of 
= 1=f eerss | yy — Many ag 
(I+ 28 + 2)! 
taken over the surface bounded by the generators through the origin and 
the point (&, », ¢) is 


—are tan [E22 /(n? @ + C26? + &%4?)#), 
66. Prove that for —1 <a <1 and —3< are sina < 5 


Ka) =f log(1 + @ 0082) 2, = are sina. 
0 


cosz% 


67. Show that the area in the positive quadrant bounded by the 
curves 2° = a®’y, 2 = b’y, y? = ca, y® = dx is 


fs (at — Bt)(ot — dt). 


68*. Let I’ be a closed curve in space on which a definite sense of 
description of the curve has been assigned. Prove that there is a vector a 
with the following characteristic property: for any unit vector », the 
scalar product @z is equal to the algebraic value of the area enclosed by 
the orthogonal projection of I’ on the plane II orthogonal to #. (Note 
that 2 gives the orientation of II, and I’ gives the orientation of its 
projection on IJ.) In particular, the projection of I’ on any plane parallel 
to a has the algebraic area zero. (The vector a may be called the area 
vector of T.) 
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69. Prove that in a central orbit the attraction p per unit mass is 
given by 
h? dq 
p= @ a 
where g is the distance of the tangent of the orbit from the pole and h the 
area constant (p. 425). 


Hence prove that the cardioid r == a(1 +- cos) can be described under 
an attraction to the pole equal to yr‘ per unit mass. 


70*. Let there be m fixed particles in a plane, all attracting with a 


central force of magnitude . Prove that there are not more than n — 1 
r 


positions of equilibrium for a particle in the field. 
Calculate these positions for the case of four attracting particles with 
co-ordinates (a, b), (—a, b), (a, —b), (—a, —b), where a > b > 0. 


71*. A particle of unit mass moves under the action of two forces, 
of which the first is always towards the origin, and is equal to 22 times 
the distance of the particle from that point, while the second is always 
at right angles to the path of the particle, and is equal to 2y times its 
velocity. Prove that if the particle is projected from the origin along the 
axis of x with velocity u, its co-ordinates at any subsequent time é are 


= TORE sin V(22 + y2)t cos pt, 
y= Tae sin V(2? + p2)t sinus. 


72. (a) If u, v are two independent solutions of the equation 
Flay” — fay” + ofz)y’ + Maly = 0, 
prove that the complete solution is Au --- Bu + Cw, where 
ies uf uf(x\da vf ehere 
(uv’ — u’v)? (uv’ — u’v)? 
and A, B, C are arbitrary constants, 
(b) Solve the equation 
x(a + 5)y’” — a(Ta® + 25)y”” + (2207 + 40)y’ — 30ry = 0, 
which has solutions of the form 2”, 
73. The tangent at a point P of a curve cuts the axis of y at a point 


T below the origin O and the curve is such that OP = n.OT. Prove that 
its polar equation is of the form 


_ (1+ sing)" 
ee coatttg 
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74, Determine the solutions of the equation 
ez 2 O% 
= gq 


at a8 


ayy 

Ot Ox 

75. Prove that if K is a homogeneous function of x, y, z the equation 
~(xe ris Ke) + 2(xe =0 
Ou Ou, oy oy ez 4 

has a solution which is a power of (x* + y?+ 2’). 


which are also solutions of 


76. (a) Apply Cauchy’s theorem to the integral 
f[(e+iy me (n > m > 0) 
2 


taken along a path consisting of the positive quadrant of the unit circle 
|z| = 1 and the parts of the axes between the origin and this circle, a 
small circular detour being made round z = 0; and hence deduce that 

_ (m1 m)r 

in 


> rom + yr(25*) 
oa r("t" 41) 


(b) Prove that if n = m the value of the latter integral is 7/2™+1, 
(In the complex integral the integrand may be taken as real on the 
positive half of the axis). 


77. Prove that if f is analytic = f(V x) is equal to the result ob- 


ar [2 
f cos™ @ cos n6 d6 = 
0 


tained by putting y and @ each equal to va in the expression for 
g” fy). 
oy" (y + a)rtt 
78. Show that if x and y are real 
| sinh (x + iy) | = A(x), 
where A(z) is independent of y and tends to © asz—> +. 


By integrating ieee round a suitable sequence of contours, 
show that (z — w) sinhz . 
1 1 2 (—1)" 
dinhw "wt hay nit 


SUMMARY OF IMPORTANT THEOREMS 
AND FORMULA 


. Differentiation. 

Convergence of Double Sequences. 

Uniform Convergence and Interchange of Infinite Operations. 
. Special Definite Integrals. 

Mean Value Theorems. 

Vectors. 

- Multiple Integrals, 

. Integral Theorems of Gauss, Green, and Stokes. 
. Maxima and Minima. 

. Curves and Surfaces. 

. Length of Arc, Area, Volume. 

. Calculus of Variations, 

. Analytic Functions. 
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1. DIFFERENTIATION 


Chain Rule for Functions of Several Variables. 
If u= f(é, n, Coe mals where = E(x, y), = 7(2, y),. ees 
Ug = fibe + fine + fee fees 
Uwe = feeb? + Lana? + feels? + +++ 
+ 2fenEaNe + ecEale a atate 
+ fEen + fine + feSae + ees 
with corresponding formule for u,y and u,, (p. 73). 


Implicit Functions. Tf F(x, y) = 0, 


dy __*. 
de *¥,. 
2 : 
oy —- FF, Ber Fots + F,, F 2 (p. 115-16). 
v 
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saa If €= d(z, y), n = de, 9), 
_ ty Om __ by 8 _ Pe %_ do 


— £2 “= Lid “=, 


ea ~~ 7’ on D’ d€ D’ % OD 
where 


_ 1, 7) n) _ be dy] sd 
Bar ah lle. ae = pathy — dria 


(Jacobian or functional determinant) (p. 143). 
Rules for Jacobians. 


O(a, ¥) 1 

1 = . 144), 

_ En) OE a) ey 
O(a, y) 


(2) If w= u(€, 7), v= v(, 7) and £= g(a, y), 9= le Y): 
then 
A(t, v) O(u, v) O(€, 9) 147 
aa, y) a, 2) Oe, y) ee 


2. CoNVERGENCE oF DouUBLE SEQUENCES 


Convergence Test for Double Sequences (pp. 102-3). The 
SEQUENCE Gym ConVerges, or, in symbols, 


if, and only if, for every positive ¢ there is an N such that 
| Gum — Gam |< € 


when » > N, m>N, n' > N, ae ens Then 


lim dam = lim lim nm) = => lim Gum)» 
n—> 0 n—>o \m—->o am n—> © 
m—> 2 
provided that lim ay, and Le L nm respectively exist. 
m—>o 


3. Untrorm CoNVERGENCE AND INTERCHANGE 
or INFINITE OPERATIONS 


Dini’s Theorem. If a series of positive continuous functions 
converges to a continuous limit function in a closed region, it 
converges uniformly to that limit (p. 106). 


602 SUMMARY OF FORMULZ 


Interchange of Differentiation and Integration. (Differentiation 
of an integral with respect to a parameter.) 


d b 
4 [te nay= [fle vay 

provided that f(x, y) and f,(x, y) are continuous in the interval 

under consideration (p. 218). 


Interchange of Differentiation and Integration in Improper 
Integrals. 


ace (2, y)dy =} “faler 9) dy, 


provided that f,(z, y) is continuous in the interval under con- 
sideration and the integrals f F(a, y)dy and f F.Ax, y)dy converge 
0 0 


uniformly in that interval (p. 312). 
Interchange of Two Integrations. If f(«, y) is continuous and 
a, 6, a, B are constants, 


f ‘de f le, ydy = A “dy f "fa, g)de — (p. 289). 


The order of integration may also be reversed when the limits 
are not constants, provided that both integrations are performed 
over the whole of the region concerned and corresponding new 
limits are introduced (p. 242). 


Interchange of Two Integrations in Improper Integrals. 
Bn o AB 
d  ydy=] d yada, 
[ef tend =f ay f fey) 


provided that the integral if J (x, y)dy converges uniformly in the 
0 
interval a < #S B (p. 310). 


4. Specran Derinire INTEGRALS 


Nh "ede = 44/n (pp. 262, 561; see also Vol. I, p. 496). 
0 


f peg 37 (pp. 315, 554; see also Vol. I, pp. 251-3, 
oS 418, 450). 
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Fresnel’s Integrals: 
(oo) +a 
f sin (72) dr = f cos(72) dr = Ke (p. 317). 


Fourier’s Integral Theorem: Tf f(x) is sectionally smooth and 
oer) 
i | f(a) | dx converges and if f(z-+ 0)-+ f(e— 0) = 2f(2), then 


1 id ; 
J (x) aero f é g(r)e"* dr, 
where g(t) =s5 f sis f(je—* dt (p. 319). 


The Gamma Function (pp. 325-38). If 2> 0, the gamma 
function I(x) is defined by the equation 


T(a) = f “ete dt = 2 f “eh ee dt, 


It satisfies the functional equation 
T(x + 1) = aI(a); 
hence if z is a positive integer n, 
T(n) = (n — 1)! 


For all values of z other than 0, —1, —2, .. . it may be expressed 
by the formule 


T(z) = lim Sal) 
no (2+ 1)...(e+n—]) fya1 lt a/v 
1 = ae & ao 
l@ H(1+%)e > 


where y = lim ( = — logn is Euler’s constart. Further, for 
n—>o \v=1V 
every integer m = 2, 
2 4 (—1)" dm 


ety (n—ii de log hey 
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Again, 
Ta) — 2) = 
sin 7a 


(“ extension theorem”). Hence, in particular, 
T(2) = 2] e~"dt= ? 
(3) ix Vn 


The beta function B(x, y) is defined as follows for positive 
values of x and y: 


1 +4 
Be, =f ea —yyde= f +o — neds 
0 -i 
{2 
=2 f sin®*—14 cos/—14 dd. 
0 
The beta and gamma functions are connected by the relation 


— Pry) 
Bia, y) = Te + ¥) (p. 337). 


For any complex z, 
(l— e&*)P(z) = f t*—) e-‘dt, 
where C denotes a path which surrounds the positive real axis 
and approaches it asymptotically on either side (p. 566). 


5. Mean VatuEr THEOREMS 
Mean Value Theorem for Functions of Two Variables (p. 80). 


f(a+h, y +k) —fle, y) = hfe + oh, y + 6h) 
+ if,(a-+ 0h,y + 0k), O<O<1. 


Laylor’s Theorem for Functions of Two Variables (p. 80). 
fathytk)—fe,y)=hfet kf, 
+ 5 (fae + kay + BYfey} 
+... 
+ = {anf, re C hl fin—ay + kf, i 
ni a 1 wry Tees ” 
+ R,, 
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where the remainder R, (in the symbolical notation of p. 79) is 
given by 


1 


~The L1 (n+1) 
Be (n+ I) {hfalot- Oh, y+ Oh) + kf,(a+ Oh, y+ an)} By 


@<1. 


If as m increases this remainder tends to zero, we have the 
infinite Taylor series 


fat h, y+ k) 
= fe, 9) + 1 hfe + bia} + 5 (fen + 2hlefon + her} 


1 n n n—. me 
a eb {ithe + (*)s Pe oe bnfip\ pees 
Mean Value Theorems for Multiple Integrals (p. 232). 
J [fl 948 = vAR, 


where AR is the area of R and p a value intermediate between 
’ the maximum and the minimum of f(z, y) in R. 
Similarly, if p(x, y) = 0, 


J [f@ wpe nas = uf fle, yas. 


6. VECTORS 


For the definition of a vector see p. 3. 
Let v be a vector in three dimensions with the components 
Uz, Vay Ug. 


Length of a Vector. 
Jo] = V(o? + 0? + 03°). 
Addition of Vectors. 
z=u-+v 
means the vector which has the components 


My =U ty, m= et %, % = Us t % (p. 5). 
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Scalar Product (inner product) 
uv =|u||v| cosd 
= Uy0y TP Ug, FP Uggs 
where 8 is the angle between z and v (p. 7). 
Vector Product (outer product) 
2= [uv] 


means the vector which has the components 


— |“2 Us - |“ % —|4 17). 
Ole: ae je alt Bly (p. 17) 
Differentiation. 
dUu+v)__ du, do, 
a) dt st” 
d(uv) du dv. 
ae eae acer) 


dle‘e) _ [= o| + [ (p. 85). 


If the co-ordinate axes are rotated, the vector components 
are transformed in the same way as «, y, 2, the components of 
the position vector (p. 84). 

By the derivative of the function f(x, y) i the direction of the 
unit vector 2 whose components are cosa, sina, we mean the limit 


tim f+ pcosa, y+ psina)— f(y) _ ala y) 


e—>o P on 
Hence 
) Cl) ts oO 
a cone =. + sina ay 
In particular, 
a cosa, Oo sin a, 
on on : 
and hence in general 
fe Ea 
On dx0n dy On 
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In the same way, in three dimensions the derivative in the 
direction of the vector # whose components are cosa, cosf, 
cosy is 


of _ of of of 
aos cosa =~ + cos B a cosy 5 
_ of a , af ey dz 
Fein | by an | Oe Oe ee 
The Differential Operations. 


With every scalar function f(%, 22, %3) there is associated a 
vector grad f with the components f,, fx, fx, (p. 89). The deri- 
vative of f in the direction of the unit vector 7 is grad f. 


With every vector field z(a,, 2, %3) there is associated a 
vector curl z with the components 


Ou Og Oy Ug Ug OH 
tii, Oy’ Oxy Ory Bn, day = 
and a scalar function 


div «= =e pe aot se (p. 91). 


Using the symbolic vector V oe with “ components ” 


2. cs Le we have 
On,” Oa,’ O25 


grad f = Vf, curl « = [Vz], diva =Vu (p. 92). 
Further, 
curl grad f= 0, div curl z= 0, 


div grad f= Af= mit mits s 


7. MuitreLe INTEGRALS 


For the definition of a multiple integral see p. 224. 

The rules for the addition of integrands and combination of 
regions of integration are the same as for ordinary integrals 
(p. 231). 
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Transformation of a Multiple Integral. If the oriented region 
R of the zy-plane is mapped on a correspondingly-oriented region 
R’ of the w-plane by means of a reversible one-to-one trans- 
formation whose Jacobian 
pay) 
a(u, v) 
does not vanish anywhere, then 


f f fla, y)dady = f f f(x, y)Ddudw (pp. 253, 377). 


An analogous formula holds for any number of dimensions (p. 254). 
In particular, transformation to polar co-ordinates 


z=rcos#, y=rsind 
or 
g=rcospsind, y=rsindsind, 2=rcosd 


gives the formule 


f f f(a, y)dady = f f f(r 0088, r sin 6)rdr dd, 
(Vol. I, p. 494). 


Lf [fey adzdyde =f f ff(e, y,2\r* sinBdrdads (p. 254). 


Reduction of a Multiple Integral to Ordinary Integrals (p. 243). 
Leta Sy S Pin R, and for every y let a = aly) Sa S dy) = 6; 
then 


Jf fenaedy= [ay fee, ae. 


8. InrecraL Turorems or Gauss, GREEN, AND STOKES 
For the definition of a curvilinear integral (line integral), see 
pp. 344 € seq. 
1. Two Dimensions. 
If the region FR is simply connected, the line integral 


i (ada + bdy) = f Adz 


is independent of the path C joining two points in R if, and only 
if, the condition of integrability 
ay = b, 
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holds at every point of R. In this case, if the initial point is fixed, 
the integral is a function U(£, 7) of the end-point, such that the 
vector A with components a, b satisfies the relation 


A = gradU (p. 352). 


Gauss’s Theorem. Let R be a simply-connected region and C 
its boundary. Then 


f [ {ey + ale, } dedy = fF, ydy — ga, y) da}, 
? +0 (p. 360) 
or, in vector notation, 


f [div Adudy = [ Ands = [ Ands, (p. 364) 
R 


where 2 is the unit vector in the direction of the outward-drawn 
normal, A, the normal component of the vector A with com- 
ponents f, g, and ds the element of arc of the boundary curve. 


Green’s Theorem (p. 366). 
J f(eeat u,v,)dady =— f fubodndy+ {(—woyde+ uv, dy) 


mf [shaded | oe as, 


ff wdo—vdu)dady= f {(vu, — uv,) de — (vig — uv,) dy} 


Tn vector notation the first form of the theorem is 
j Ou 
J {(gradu gradv)dady = —f fo div grad udady az =, ss 


where 
Au = div gradu = tye + Uy; 


and 0/dn denotes differentiation in the direction of the outward- 
drawn normal. 
24 (2912) 
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2. Three Dimensions. 
The necessary and sufficient condition that the line integral 


f (adz + bdy + edz) = [Adx 


shall be independent of the path C joining two points in a simply- | 
connected region R is | 
curl A = 0, 
or, in full, 
y= bz, b,= Cy, Cy= a, (p. 358). 
Surface Integral (p. 381). This is given by 


f f {a(x, y, z)dydz + b(a, y, z)deda + e(a, y, z)dady} 


a 2) o(z, x) 0(z, ¥)\ 4,4 
“Tf fe ax, Y, 2) 5 rar a Ys 2) 5 Aca, ae O(a, Y, 2) 5 (as, 2 dud, 


if x= a(u, v), y= y(u, v), 2= 2(u,v) and the oriented region 
B in the wv-plane corresponds to the surface 8S. 

Gauss’s Theorem. Let 2 be the unit vector in the direction of 
the outward-drawn normal and A, the normal component of the 
vector A with components a, 6, c, further, let o/on denote 
differentiation in the direction of the outward-drawn normal. 
Then 


= ox oy Oz 
[ff fleet b+ cdedyds=f f(a + 0% + 0%) as, 
(p. 386) 
or, in vector notation, 


Jf [div Adadyde =f [Ands =f [4,d8, (p. 388) 


the integrals on the right being taken over the closed surface S 
bounding the region R. 
Green’s Theorem (p. 390). 


i i: B (Ug0g + UyVy + U,0,) da dy dz 


=—f f fudodedyde +f fus, © as, 
Jf [(ude—vdu)dedyde =f f (uz av 02) as, 
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where 0/dn and S have the same meanings as before and 
Au = Use + Uyy + Usa 


Stokes’s Theorem (p. 393). Let the oriented surface S be 
bounded by the correspondingly-oriented curve C. Then 


SG aves B-B)eeet (E- 3) e4| 
=| (oda + dy + xd). 


In vector notation: let A, be the tangential component of the 
vector A= (¢, %, x) in the direction in which the curve C 
is described, (curl A), the component of curl A in the direction 
of the outward-drawn normal, and ds the element of arc on C 
measured in the direction in which the curve is described: then 


J {cat A),as = [ Ards. 


9. Maxtma anD Minima 


The following rules hold only for maxima and minima in the 
interior of the region under consideration. 


Free Maxima and Minima of a Function of Two Variables. 
The necessary conditions for an extreme value of the function 
w= f(z, y) are 
fza=9, fy=0 (p. 184). 


If these conditions are satisfied and if 
Tewfuy — fry > 0, 


there is an extreme value at the point in question. It is a maxi- 
mum or a minimum according as f,. (and hence also f,,,) is negative 


or positive. If 
Soofyy — fi <9, 
the point is a saddle point (p. 207). 


Maxima and Minima subject to Subsidiary Conditions (Method 
of Undetermined Multipliers) (pp. 188-99). 
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If in the function v= f(a,,..., %,) the n variables are con- 
nected by the m subsidiary conditions (m < n) 


Pilayy oo +5 La) = 0, 20> Pm(By - ++» Vn) = 0, 
we introduce m multipliers A,,..., Am and form the function 
F=f+ Adit Agate. - + Am Pm 
Then the m conditions and the ” additional equations 
5a =0,..., Zao 


give (m-- n) necessary conditions for the extreme points, 


10. Curves * anp SuRFACES 
In what follows (¢, 7), or (, 7, ©), are current co-ordinates. 
1. Plane Curves. 
Equation of the curve: 
@) y=f@), ©) F@y=0, (z= 40, y= $0. 


Equation of the tangent at the point (x, y) (Vol. I, p. 263; 
Vol. II, p. 122): 


(a)n—y= (&—2)f'@), (0) (E—2)F.+ (n— WF, = 0, 
(c) {E— O}¥'O — {n — ¥O}P'O = 0. 


Equation of the normal at the point (a, y) (Vol. I, p. 263; 
Vol. IT, p. 123): 


(a) E—2+ (n—y)f'(z)=0, (6) (€—2)F,— (9-9) Fe= 0, 
(c) {E — PO}$') + {n — YO} YO = 0. 
Curvature (Vol. I, p. 281; Vol. II, p. 125): 


== y” Pg FP Way PoP y + Pw PoP 
Oa ap OF a 7 
dh — be 
k= 5. 
OE a 


* Some formule discussed in Vol. I have been repeated here for convenience 
of reference. 
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Radius of curvature (Vol. I, p. 282; Vol. IT, p. 126): 


etl 
[FI 


Evolute (locus of centre of curvature) (Vol. I, pp. 283, 307-311): 


rl 1} 1 vy 
(2) €=a-—y zm q=yt re, 
y y 
F2+F, 
OE oe Oey , 
0) f= a+ Fea Fa OFF, F, + FAFA 
F2+F2 


I= ot fe FS OFF, + PPP 
gy BER poy gg PEE 
Involute (Vol. I, p. 309): 
E=a2-+ (a—s\i, n= y+ (a— 5), 


where @ is an arbitrary constant and s the length of arc measured 
from a given point (s being the parameter). 


Point of inflection (Vol. I, pp. 159, 266; Vol. IT, p. 125): 
The necessary condition for a point of inflection is 


(2) y" = 0, () Fyek } — 2F FF, + Fy, Ff? = 0, 


(0) éy — aj = 0. 
Angle between two curves (Vol. I, p. 264; Vol. IT, p. 126): 
F.G,+ FyG 
b) cosw = =—2 eid 5 
©) 0080 = TSE FAW G+ GP) 
bay Fs 
(c) cosw = cee LL Ee 
VE + PV? + 91”) 


In particular, the curves are orthogonal if 
(6) FaGe+ FyG,=0, (0) tt, + oy = 0; 
the curves touch if 
(0) FeGy — FyGe=0, (¢) ayy — hy = 0. 
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Two curves y= f(x), y= g(x) have contact of order n at a 


point 2, if 


S(®) = g(a), f'(2) = 9'(@), «++, F@) = 92), 
fon) g(a) 
(Vol. I, pp. 331-3). 


2. Curves in Space. 
Equation of the curve: 
c= $f), y= 4), z= x(t). 
Direction cosines of the tangent (p. 86): 
VEEP +A VOPR AY VO+P+A) 
Curvature (p. 86): 


aim =/ (Ga) + (Gt) + (Bf 


where ds is the element of arc. 


3. Surfaces. 

Equation of the surface: 
(a) z=f(a,y), (6) F(x, y,2) = 0, 
(c) = g(u, v), y= Pu, v), == x(u, v). 

Equation of the tangent plane (p. 130): 
(a) $—2=(€—2)f.t(n— fy 
(6) (€— 2)F. + (n— y)F, + ($ — 2)F, = 0, 
(c) (€ — ®)\(tuXe = toXu) + (0 — y)(xubo — Xobu) 

(= (dub ~_ Poipu) = 0. 


Direction cosines of the normal (Vol. IT, pp. 130, 163): 


a= ae eae: eee = — ee 
aes Varfeapy 8 — Tas py 
1 ry 
V+ f+ fi) 


cosy = + 
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(b) cosa = eee. eae cosp = a ee 
V(F e+ FY + F) V(F 2+ FP + FY) 
F, ‘ 
ONY VP + PEE FA 
(c) cosa = ae eee cosB = geen 
Via + B+ VEE BTS 
Cc 
oY VF BE OY 
where 


A= duxo— Yoxw B= Xubo— xobw C= duho — bobu 
Angle between two surfaces (Vol. IT, p. 130): 

COSw = COS a, Cosa, + cos, cos, + cosy, Cos y2} 
in particular, the condition that the surfaces are orthogonal is 
cosa, cosa, + cosf, cos, + cosy, cosy, = 0. 
4. Envelopes (Vol. II, pp. 171-83). 


To obtain the envelope of the family of plane curves 
f(z, ye) =0 

or of the family of surfaces 

S(@ Y 2 ©) = 9, 


we calculate the “discriminant” by eliminating c from the 
equations 
f=090, f-=0. 


The discriminant contains the envelope and also the geometrical 
locus of the singular points. 

If the family of curves is given by the parametric equations 
x= (t, c), y = (t, c), the discriminant is obtained by eliminating 
c and ¢ from the equations 


z= f(t,c), y= pt, ¢), se x _ - = = (p. 174). 


The envelope of a two-parameter family of surfaces 


I (a, y, Z, Cy, Co) = 0 
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is contained in the equation obtained by eliminating the two 
parameters ¢,, c, from the equations 


f=9 f,=90, f,,=0. 


11. Lenets or Arc, AREA, VOLUME 


Length of Arc (Vol. I, pp. 276-80). Let a plane curve be given 
by the equations 


(2) y=f(z), (6) F(z, y)=90, (c) c= dt), y= Hf), 
(2) (polar co-ordinates) r = 7(9). 


The length of arc is 
n t 
= f = i Yd dt, 
(a) s [vat yee, () s=[ vieh+ #) 


Oem [PVF I+ Rds,  s=f-yert rab 


The length of arc of the three-dimensional curve 
c=), y= $0), z= x(t) 


4 
s=f Vit + P+ edt (p. 86). 
ty 
Area of Plane Surface. The area bounded by the curve 


r= 1(0) 


and two radii vectores 9, 0,, where r, 8 are polar co-ordinates, 
is given by 


6, 
2 f rd0 (Vol. I, p. 275). 


The area enclosed by the curve 
y¥ =f (2), 


the two ordinates = Zp, x = 2,, and the z-axis, is 


f "yaa (Vol. I, p. 80). 
a 
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Let # be a positively-oriented plane surface and C its boundary 
(for the orientation and sign of an area cf. Chap. V, section 4, 
p. 375). Then the area of the surface is 


(ase [pfx iene 


(pp. 347, 375-6). 


Area of Curved Surface (pp. 268-74). Let the equation of 
the surface be 
(2) z= f(x, y)s () F(z, y, 2) = 0, 
(c) c= (u, v), y= Plu, v), z= x(u, 9). 
In case (c) let Z, F, G be the so-called fundamental quantities 
of the surface, i.e. let 
E= os + p? + Xe 
F= dubo + Huby + XuXov> 


G= b2+ p+ xP (pp. 162, 273). 
Then 
EG— F? = (dutho— Puthu)? + (fuxXe— Poxu)?+ (XuPo— Xvbu)* 
(p. 273). 


The length of arc of the curve 
u= u(t), v= v(t) 
drawn on the surface is then 


= fi "(BaP + QF is + G02) dt (p. 162). 
ts 


The area of the curved surface lying vertically above the 
region & in the «y-plane is 


A= f f do: 
(a) 4=f [VA+f2+fi)dedy, 
0) 4=f f 5 vba + FP + Faded, 
@A=/ cs WEG — F*) dude, 
the last integral being taken over the region B of the uv-plane 


which corresponds to the region R. 
21° (8912) 
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The area of the surface of revolution 
Z=uUucosv, y=usiny, z= d(u), 
which is produced when the curve 
a= g(a) 


is rotated about the z-axis, is 
A= anf u/{l + p’%(u) }du = an { ude, 
Uy So 


where s is the arc of the meridian curve z = ¢(x) (Vol. IT, p. 274; 
cf. also Vol. I, p. 285). 
The surface w,, of the unit sphere in n dimensions, 
By? + a2 + ve + 22 = ’ 
is given by 


Volumes. The volume bounded below by the region R and 
above by the surface S with the equation 


z =f (x, y ) 
is given by 
= d . 225). 
V=f ff, ydedy (p. 225) 
(for the sign see Chap. V, section 4, p. 380). 


If the surface S is closed and forms the whole boundary of 
the region V, the volume of this region is given by 


V=f f [dedyde = — f frdudy=—f fadyde=— f fydede 


(p. 387). 
In polar co-ordinates the same volume is given by 


Va=f f [ 7 sin@drdodg, 


where B is the region of r6$-space corresponding to the region V 
(p. 254). 
The volume of the surface of revolution 


c= ucosr, y= usin, z= o(u), 
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which is produced when the curve 
z= ¢(2) 
is rotated about the z-axis, is 
V=a7 is ‘ude 
(Vol. IT, p. 267; cf. also Vol. I, p. 285). 
The volume ¥, of the unit sphere in ” dimensions, 
@2 4-7 4-2. ft OQ = 4) 
is given by 


_ (vz) 
"= Fin + DA} (p. 304). 


The volume swept out by a moving plane area P of area A 
is 
f dn 
V=] A(t) — &, 
[405 


where dn/dt is the component of the velocity of the mean centre 
of P perpendicular to the plane of P (p. 295). 


12. CaLcULUS OF VARIATIONS 
The necessary and sufficient condition that the integral 
I(u) = f "F(a, u, u')da 
ZL 


shall be stationary is Euler’s equation 


d 
Ff, “uo dz F. uv 0, 
or 
Py! + Baw + Pow — Fy = 9 (p. 498). 
If F involves several functions (xz), u(x), ..., Un(z) and 


their derivatives, then a necessary and sufficient condition that 
the integral 


I(u) — [Fe UW, sees Un, ws, Ree} un) 
Xe 
shall be stationary is that w,..., u, satisfy the system of 
Euler’s equations 


F,,— = Py,=0 (6=1,...,0), — (p. 508). 
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If F depends on z, u(x), u’(x), u’’(x), Euler’s equation is 


d ad 
Fy, —7Z, Fu + 7 Fy =9 (p. 513). 
If f F(a, y, 2, %, ¥, 2)dt is to be made stationary subject to 


the subsidiary condition G(x, y, z)== 0, then a necessary con- 
dition is 


d 

gy fe Fe= AGe, 

d 

© F,~F,= 2G, 

dt v Vv | 
d 

at s— F, = G,, 


where A denotes Lagrange’s multiplier (p. 517). 


13. Anatytic Functions 


For definition, see p. 532. 
The necessary and sufficient condition that 


SQ) =f@et+ wy) = ux, y) + we, y) 


shall be analytic in a region # is that in R the Cauchy-Riemann 
differential equations 
Ug = Vy Uy = —0g 
hold (p. 532). 
Cauchy’s theorem: If f(t) is analytic in a simply-connected 
region R, then 


[fou=o 


if C is a closed curve in the interior of R (p. 539). 
Cauchy’s formula: Under the same condition as Cauchy’s 
theorem the formula 


ot—zZz 


holds if z is a point in the interior of C. 
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If f(z) is analytic in the interior and on the boundary of a 
circle | z — 2 | S R, it can be expanded in a power series 


SO) =f) + B gle — 4) 
which converges in the interior of the i Here 


f® (%) 


ANSWERS AND HINTS 


CHAPTER I 
§ 1, p. 12. 

3. Let the vectors joining O to the points P, Q, R, S be denoted by 
b,q.r.s. Then the vector from O to the centre of mass of the triangle POR 
is given by #(# + g + 7”), and (cf. Ex. 2) the vector joining O to the centre 
of mass of the tetrahedron by $##(2+9+7)+tS=HF+9+74+S); 
this expression is independent of the order in which the vertices are taken. 

4. A, A’,....@ are the final points of the vectors 42+ 9), 
ir + s),...,4(9+ 7), and the three lines AA’, BB’, CC’ all have the 
same mid point, the final point of the vector }(#+ 7+ %”-+ 5), which is 
the centre of mass of the tetrahedron. 


§ 2, p. 18. 


1. The distance is given by the length of the vector product of a unit 
vector lying along / and any vector joining P to a point A (6, d, fyin k 


1 |a%—-b yo—d Yo—d %—f |? |%—f %—6 |? 
verre | i : ° | + | ee } 


c e| le 
2. The shortest distance h between / and I’, two straight lines in space, 
is perpendicular to both / and U’, ie. is parallel to the vector product of 
two arbitrary vectors lying along J and /’ respectively. Also, the shortest 
distance between / and I’ is obtained by projecting a line joining any two 
points on / and ?’ on to the line A: 
, e 


1 , 
V {(ae’ = a’c)? + (ae’ aa a’e)? + (ce? c’e)*} b ie vd . d’ f- f 


+ 


a c e 


3. The left-hand side may be interpreted as the volume of a tetra- 
hedron. 


4. The length of the vector product of the vectors (wx, of, oy) and 
(x, y, 2): @V {(Bz— yy)® + (yz — a2)? + (ay — Bau)?}. 

6. It is sufficient to prove the statement for the case where the origin 
is inside the polygon, as the sum of the determinants is unaltered by trans- 


lation of the co-ordinate system. If the origin is inside the polygon, all the 
623 
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determinants have the same sign and give the areas of the triangles 
OP,P., OPPs, ..., OP aPy 
§ 3, p. 26. 

2. If we write —-d=dx —1, —e=ex —1, —f=f x ~1, the 
three equations may be regarded as three homogeneous equations in 
x, Y, —1; the necessary condition for the existence of a solution is therefore 


D=0. If D=0 and eg. [3 | + 0, then the third equation is 
an) 


a consequence of the first two, and the first two equations in x and y 
have a solution, as their determinant does not vanish. 


3. The lines intersect if the three equations 


at+b,=aqt+ 
At + by = ct + dy 
agt + bg = cat + dy 


for ¢, t have a solution (cf. Ex. 2). The condition is 


% & 4—b, 
G@, Cy dz— b,)=0. 
a, Cs dz— bg 


5. Subtract the last row of the determinant from the first three. 


§ 4, p. 37. 

1. (a) 0, (6) 2, (¢) 12, (d) (w@— oly — ez — ze + y + 2). 

2.a+c= 2. 

3. (a) Introduce the three vectors 2 = (a, b, c), y= (a’, ¥’, c’), 
z= (a”, b”, ce”). Then D= [xy]z. Now for any two vectors a@ and 6 
we have 
|[2@6}| S|a@||S| and|ab|S|a||S]. Hence DS |x| |y| | 2]. 

(b) If, and only if, the vectors represented by the columns of D 
are mutually orthogonal. 

4.ab+cd=0,08+8=8+@?=1. 

6. It is sufficient to show that there is one point (2, Yo, %) which re- 


mains on the same ray through the origin, i.e. that there are four quantities 
Xq» Yor Zo, A (the first three of which do not all vanish) such that the equations 


datg = Aq + by + Cz 
Yo = Ug + CYy + fay 
y= 9% + hyy + kr 


are satisfied. Now we have only to choose A so that the determinant of these 
three homogeneous equations in 2, Yo, % Vanishes; this gives an equation 
of the third degree in A, which can always be satisfied, as an equation of 
the third degree always has a real root. 
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7. af = }(1 + cosg)a— eV 28ng.y— HI — C08 @)2. 
y =EVv2sing.2+ cosp.y + $V 28ing.2 
2’ = }(cosp — la — 4V 28ing.y + F(c08 + Le 
9, By Ex. 1, p. 12, and the rule for the multiplication of determinants, 
the square of the determinant is equal to +1. 


CHAPTER II 
§§ 1, 2, p. 49. 
2. Hn + 1m + 2). 4. [5 2| > o 


5. (a) No. (6) No. (¢) No. (d) No. (e) Yes. ( f) No. (g) Yes. 
(k) No. 


§ 3, p. 58. 
1. Cf. Ex. 2, p. 49: $(n + 1)(n + 2). 
3. 62 + 2a+t+e+ k). 


§§ 4, 5, p. 77. 
2. We may take the origin at the vertex of the cone; its equation is 


then of the form vu = (2) X 


n-1 


2 
4. (2) Ore + 7 5. Ore + Gre 


6. Cf. p. 391. 


§ 6, p. 81. 
1. xy. 
2. Use Taylor's theorem, expressing f(2h, e~U2h) and f(0, 0) in terms 
of f and its first and second derivatives in (h, e—/*); add and divide by h*. 
) ~o Lad o Ma 
4.(2) & S (™ +) amyn: lzjtiyl<l 2 E2%, an 


n=0 m=0 n=O mao mint 


values of 2 and y. 


§ 7, p. 93. 
1. Use Taylor’s theorem to express the co-ordinates of a point on the 
curve in terms of f, g, h and their first and second derivatives in ¢; then 


apply Ex. 3, p. 19: 


a — f(t) f’(to) F(t) 
y — Glo) (to) (to) | = 0. 
z — h{to) h’(to) h’'(to) 


3. If » is the centre of the sphere, the expression A=V xls) — vy}? 
must be as stationary as possible, that is, A, A, A must vanish (the dots 
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denoting differentiation with respect to s). Using the relations 2? = 1, 
xX =0, we obtain the equations (y—-x)x=0, (y—-x)xX=1 
[~ 2%] 


(y— x)4%= 0. Hence we have y— x = 


5. Cf. Ex. 3 and also Ex. 5, p. 19. 
7. From the definitions sof E,. &, §; we have §,=%, **=1, 
b= X/k, &y = [E,Eel, + J/ E,?= 1/t. Obviously &, = kE,. To determine 


Ee &s, we calculate their components with respect to a rectangular co- 
ordinate system O&,, O&, O&. From the relations 


Ee= 1, Gs? = 1, 6,8, = 6,6, = 6,5, = 0 
we obtain by differentiation 
EsEi = —E,Es = 0, Es = 0; 
hence &, is perpendicular both to E, and to gs, and therefore 


Es = + V (Es E» =+ &,/. 


We define the sign of + so as to give gE = — &,/t. This implies that tr is 
positive or negative according as the screw defined by the motion of the 
osculating plane in the direction of increasing s is right-handed or left- 
handed. To prove the second formula, note that 


eb: = —E, b= —k, Ete = 0, E.E3 = —~€s8 = 1/*. 
8. Use Ex. 6 and Ex. 3: (a) hE, — #8, + ree (b) ” Es -+ be 


Tv 
9. 1/| t| = VE,” = 0; hence €, is a constant vector y, say; x4 = Ey 
= £,&s = 9, so that xy = const., where y is a fixed vector. That is, the 
curve lies in a fixed plane. 
10. (6) If the curve is given by 2 = f(t), y = g(t), z = A(t), the sur- 
face has the parametric equations 


% = f(t) + sf'(t) 

y = g(t) + ag(t) 

z= h(t) + eh'(t); 
az Gz Oz. 


=) =? =; in terms of the derivatives with ect to t 
apie Bat’ dady’ dy in ives with resp 


Appendix, § 1, p. 100. 


1. (a) As RB is closed, there is a point B in R whose distance from 
A is less than that of any other point in R. Let » be the normal to AB 
at B. Then no point C in RB lies on the same side of n as A; for otherwise 
not only B and C, but the whole segment BC, would belong to R, and on 
this segment there would be points nearer to A than B is. Hence the parallel 
to x through A cannot meet R, 
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(b) There is a sequence of points P,, P,, ..., not in R, converging 
to P. Letl,,l,,... be straight lines passing through P,, P,, ... respectively 
and dividing the plane into two half-planes, one of which contains no 
point of R (cf. (z)). From these straight lines we can choose a sub-sequence 
for which the directions also converge. The limiting straight line is then 
a line of support through P. 

(c) If A were not in R, then by the proof of (a) a line of support 
separating A from R would exist. 

(2) Let G be the centre of mass of R and g any line of support, which 
we take as x-axis. Then the y-co-ordinates of all points in # have the 
same sign. By the definition of the centre of mass (cf. Vol. I, p. 284), the 
y-co-ordinate of G also has this sign; that is, G and R are on the same side 
of the arbitrary line of support. Now apply (c). 

(f) The curvature is equal to dp/ds, where ¢ is the angle which the 
tangent makes with the z-axis, s the length of arc; 9 is a continuous func- 
tion of s. Hence 9 increases monotonically from 9(0) to 9(0) + 2x; that 
is, p cannot have the same value for two different points of the curve. 

If the curve were cut at three points %, 8, 8 by a straight line / 
(ax + by = c), then the function 


F(a) = az(s) + byl) — ¢ 


would have three zeros; in this case F’(s) would also have at least three 
zeros, i.e. there would be three tangents parallel to 1. In addition, two of 
these would certainly have the same sense, i.e. they would have the same 
value of ~, which contradicts the statement above. 


2. (a) The set consisting of the points which lie in all convex regions 
containing S has the properties (1), (2), (3). 

(b) If P is in EZ, there can be no straight line 7 separating P from S; 
for otherwise one could take e.g. a large square Q with one side on Z and 
containing 8; @ would then be a convex region containing S but not P. 

If P is not in Z, there is at least one convex region Q containing S but 
not P; then (cf. Ex. 1(a)) there is a straight line separating Q from P, and 
therefore, as Q contains S, also separating S from P. 

(c) Cf. Ex. 1(d). 


Appendix, § 2 (p. 107). 
1. (a) No. (b) No. (ce) Yes (cf. Vol. I, p. 436). 


CHAPTER Hi 
§ 1, p. 122. 
5 . a ag . —tI]. —_ 
2. (a) ae (6) 3” (c) 1; (@) 1. 
21 spite cake 
3. (a) ~ 39” (b) Ts (c) 2; (d) 3 . 


628 ANSWERS AND HINTS 


4, Max. value +6, min. value —6. 
5. d/de = —1, d/dy =—1. 


§ 2, p. 131. 


1. (a) Be + Ty — 212+9=0; (b) 20x + 13y + 3z = 36; 
(c) =—y—z+n/6=0. 
2.1. 
3. Use the fact that the tangents at the origin are given by y = 0 and | 
az-+ by=0. 2c/a, 2(a8g — a*bf + abe — b8c)/a(a® + b*)*2, 
4. Write equation in form 0 = F = f(x? + y’, arc tan y/z): 


Qy/2 — py! 4. 7% af » _@f 
= TN r’ =>, r’ = « 
(iy apa? Where’ = 7 dot 
6. xy + z) = ay. 


8. (a) Double point. 
(6) Two branches touching one another. 
(c) Corner, 
(2) Cusp. 
(e) Cusp. 
9. Differentiate the equation F = 0 twice with respect to x and use 
the fact that F, = 0. 
@ = aro tan 2°/ Fay? — Poe Fy! (Foe + Fyy)s (@) 7/23 (6) 10/2. 
10.a=1, b= —}. 
12, The circles K, K’, K” may be denoted by the equations 


K=P+yY+axr+by+e=0, 
KR =2+yY+aze+by+e=0, 
KV’ = e+ Pt a et byte = 0. 


Then any circle passing through A and B is given by K’+ »K” = 0. 
The conditions that the circle K should be orthogonal to K’ and K” are 
aa’ +- bb’ — (e+ c’) = 0, aa” + bb” — 2c + ce”) = 0. From these con- 
ditions the corresponding relation expressing the orthogonality of K and 
K’ + 2K” readily follows. 


13. = 5 ates 


§ 3, p. 187. 
2, oj Me 
Az,y) (w+y?? 
3. Take O as the origin and invert; then the curvilinear triangle is 
transformed into an ordinary triangle with the same angles. - 


5. (b) If we denote the left-hand side of the equation defining #, and 
ty by F(x, y, t), two curves ¢, = const. and ¢, = const. are given implicitly 
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by the equations F(x, y, ¢,)= 1 and F(a, y, &)= 1 respectively. The 
condition that these should be orthogonal is therefore 


O= F,(a, y, 4) Fula, y, te) + Fra, ys HF A% Y, t2) 


= 2 + SS 
(@—t)a—%)  (6—4)b— 4) 


but this relation is an immediate consequence of F(z, y, t,) ~ F(x, y, t.) = 9. 

(c) The coefficients of the quadratic equation defining ¢, and ¢ are 
respectively equal to ¢,, #,, and —(t, + #,). We thus obtain two linear equa- 
tions in 2? and y?, whence 


en en 
a tote) dory(a — b) ; 
x,y) Vila + 6)? — 2a — bya — y*) + (2 + y*)?} 
(e) fio’ Sage 


(@—t)o—4) @—%)o—&) 


6. (a) Let F(é) be the left-hand side of the equation defining t. F is 
@ continuous function of ¢ in —«o <t<ce, for which F(—o)= 0, 
F(e— 0)= +0; hence F=1 at one point at least of that interval. 
Similar conclusions apply to the other intervals. 

(6) Cf. Ex. 5(6). 

CE Ex. oe): 2x +, M@— aa — Ae ~ fb) 

(0) Of. Ex. 5(0): @= 4/2 
with similar formule for y and z. 

7. (b) Let «= rcos0, y=rsin®. Then the straight line 6 = const. 
is transformed into the conic t, = $— cos?@ and the circle r = const. 


into the conic f, = -i(" ES a) 


9. a(u, 2 + y?) q|“ “vio, 
ax, y) zy 
§ 4, p. 167. : 
1. (6) A circle on the sphere is given by a linear equation in 2, y, z. 
dut 
(d) det = 4 a? + de 


2. (a) ds? = sin*v du? + dv’; 
(5) ds* = cosh?v du* + (1 -+ 2 sinh?) dv; 
(c) ds? = (1 + f)dz8 + og 


(t) — 6)(4 — (t. — t,)(ta — ts) 
d) dst 1 A Pett St" 
= Ge meat is—0—De—w) 
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2 
2, 4, . 
nits ; use the transformation 


%y Ly 


2 
Yu 2u 


3. EG—- P= 
« %y 


formula for Jacobians, 


®y Yu 
z 


+ 


& Ft 


4. Introduce co-ordinates 2, y, z such that P becomes the origin, the 
tangent plane at P the ay-plane, and ¢ the x-axis. The equation of 8 then 
takes the form z= f(x, y), where f(0, 0) = f,(0, 0) = f,(0, 0)=0. A 
plane & through ¢ is given by the equation z= ay. We now introduce 
r= 4/(y*?-+ 2) and x as co-ordinates in Z; then the intersection of D and 
8 is given implicitly by the equation 


re 


Tears Hed bi Viral 


The curvature of the curve of intersection at the point z= 0,r= 0 is 
therefore (cf. p. 125) given by 


2 
k= fom ware) 


Thus the centre of curvature of this section has the co-ordinates 
\ 
1 a2 


= = bed = ee = . 
RVI + a) fg(1+ 02) : EVQ+@) fe(l+ a) 


z=0, y 


that is, it lies on the circle 
tral? + 2)—z=—9. 


5. Take the tangent plane at P as the zy-plane. Then the equation 
of 8 may be taken to be z= f(z, y). A normal plane is given by the 
equation = ay. Take r = v (a? + y?) and z as co-ordinates in the plane; 
then the curve of intersection is given by 


Z== f{ es ce \ 
V(l+ a) V+ a) 
and its curvature at r= 0 by 
1 
1+ a2 


b= fas(0, 0) 5 + 2fay(0, 0) Tai tal) 


+ « 
the final point of the vector of length 1/Vk along the line ¢ then has the 
co-ordinates 


a 1 1 1 
es Vite) VEY Va pat) VE 


z= 0; 


that is, it lies on the conic 


foo + 2Y fay + ¥fyy = 1. 
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6. (a) By differentiating the two equations with respect to a para- 
meter ¢ of the curve, we obtain 


ax! + yy! + 22’ = 0, axx’ + byy’ + czz’ = 0. 


From these relations we can find the ratio 2’: y’: 2’, i.e. the direction of 
the tangent. If (&, y, ¢) are current co-ordinates, the equations of the 
tangent are 

_¢—6 a—c b—a 


(6 — 2): (qy— y): (0 — 2) = —— : —— : _—.. 
xz 


(b) By differentiating the equations of the curve a second time and 
using the result of (a), we obtain 


(a—c)? (b—a)? 
y? + Za 


—b)* 
ze" + wy’ +e (+ yt tea { ES ) = 


and 


- Je wo a(c = 6)? b(a _ c)? c(b _— a)* 
axx”’ + byy” + czz =1{ = + 7 + a \ 
where ) is a factor of proportionality. Eliminating 4, we have 


— 2 — ce) = 2 
(am’” oe yy” + za’) (“5" + “ ue c(b =) 


(e— oF , (a—e? , b—ay 

== (ax + byy” + caz’ ee + 7 we ae \ 
This linear equation in z’’, y’’, 2” remains valid if we substitute 2’, y’, 2’ 
for «”, y”, 2’. Hence it is still satisfied if we replace x”, y”, 2” by some 
linear combination Az’ + yx", Ay’ + uy”, 22” + uz” respectively. Now 
if (, 7, ¢) is in the osculating plane, § — x, y— y, © — z are just such 
a linear combination (cf. Ex. 6, p. 94). 

The equation of the osculating plane is hence found to be 
ax by? cz8 
pee, hes a Gop ett UO 


§ 5, p. 182. 


1. Let P(x, y, z) be a point on the tube-surface Z, and let S be the 
sphere of the family which has the point P in common with &. Then 8 
and © have the same tangent plane at P, i.e. the same values of x, y, 2, 
2,» %, at that point. It is therefore sufficient to prove that the relation is 
true for any sphere of unit radius which has its centre in the zy-plane, 
ie. for u(a, y) = Vl — (2 — a)? + (gy — 5}. 


2. (a) Ve + Vy 4+-V2=1; (b) c+ yi t+ 2t= 1 


4. We may introduce ¢ as parameter on the curve, so that the latter is 
given by x= 2(é), y= y(t), == a4) and the tangent at the point with 


632 ANSWERS AND HINTS 


parameter # lies in the two planes corresponding to #; this gives the re- 
lations 
ax’ +- by’ + cz’ =0, da’ + ey’ + fe = 0. 


By differentiating the equations of the straight lines with respect to ¢, 
we thus obtain 
we+by+cz=0, d’et+ey+fz=0. 
With the relation 
az + by + cz = dx+ ey+ fz 


we then have three homogeneous equations in x, y, 2, and the determinant 
must vanish. 


5. For the envelope we have the two equations 
wcost-+ysint+z=8 
—xsint + ycost= 1, 
These two equations give a family of straight lines with parameter ¢; if 
a curve having these lines as tangents exists, it must also satisfy the 
equations obtained by differentiating once again. 


(a) rsin{z+ V(r7—1)— 0}+1=0; () the curve is given by 
z= 6—7/2,r=1. 


7. Use inversion. Since 8,, S,, S, pass through the origin, they are 
transformed into planes; we have then merely to find the envelope of 
the spheres touching three planes, i.e. a certain circular cone, which we 
reinvert: 


(+ yt + ait Oat + yt + Me + y + 2) 
— 3a? + y? + 24 — ay — Qaz — Qyz) = 0. 
8. (b) EX1 — at) + 9X1 — b%) — 2abEn + 2ak + 2 = 1; 
(c) a&3 + 6772 = 1, 
§ 6, p. 202. 

1. (44V5)/V2, (4—V5)/V9. 
2. a/20, a/10, a/10. 
3. Maxima for x = 0, y= +1, minimum for 2 = y= 0. 


4. The maximum value is the same as for the expression ax? +- 2b2y + cy? 
subject to the subsidiary condition ex* + 2fzy + gy? = 1. 


5. Of. Ex. 4. (a) e+ 2v er, 


(b) the function has an improper maximum (p. 184) equal to 1-95, 
when y/z = 0-64, 


ANSWERS AND HINTS 633 


6. Saddle points: y = 0, z= 1/3, 7x/8, 1377/3, ..- « 
Minima: y = 0, z= 5n/3, 11n/3, 177/3,..- - 
7. The ellipse obviously touches the circle, i.e. the two equations 


must give a double root in 2; hence the condition for contact is 
a%(b? — 1) = bs a = 3/V2,b = V (3/2). 


8. Introduce the angles between a, b and c, d as variables: the cyclic 
quadrilateral. 
9, (—1/V 14, —2/V 14, —3/v 14). 


10. Ct. the similar proof for triangles on p. 187. A minimum point 
O does exist. First show that if O is not one of the vertices, then it can 
only be the point of intersection of the diagonals. Use the fact that the 
final points of four unit vectors whose vector sum is zero form a rectangle. 
Then prove that the sum of the distances from the vertices is less for the 
point of intersection of the diagonals than for any of the vertices. 


11. A = a?/a, B= b/y, C = c*/z, together with the subsidiary con- 
dition 
2 yf a 
atata” 
at ai 
O2= apap M*=TeyoF a ™ 

12. The vertices are given by = +a/V3,y= +6/V3,2= +te/V3. 
13. The vertices are given by z = a#/V (a? + 07), y= b2/V (a? + 5). 
14.<=l1y=1. 


15. The greatest axis is given by the maximum of V (2% + y* + 24), 
with the subsidiary condition that (x, y, z) lies on the ellipsoid. Hence 
we have the three equations 


x 


xz 
Verypr?) i* Naz + dy + ez), &eo. 


Multiplying these by (%, y, 2) respectively and adding, we have 
A= V(t y2 + 24) = 1. On the other hand, we may regard the 
equations as three linear homogeneous equations in 2% y, 2, whose 
determinant must vanish. 


Appendix, p. 208. 


1. f(z) + f(y) + Ff = 3f(@) 
+ {(@— a) + (y— a) + (2— a)}f'(a) + BeLI(a) + €}, 


where p= (x — a)®? + (y— a? + (2 — a). On the other hand, the 
subsidiary condition gives 
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(g—a)+ (y—a) + @—a)= (- 2°) ea ‘) 


2¢’(a) 
~ Fs {(e — ayy — a) + (7 — a)(z— a) + (y — ale ~ a} 

{| #@ , ¥@) 

= {~ 35 * zat *} 
where lim « = 0. 

hYt—>ae 
CHAPTER IV 

§ 1, p. 222. 


1. F=0Ofory > 0. 
2. Use the relation 


1 
5 a cos@ + f, sin 9) ee sin? — 2f,,, sin p cos 9 + fay 008" 9 
+: oe i Mle singe — f, cos). 


3. Integrate u,,, by parts twice (special precautions necessary in the 
case where p < 5/2). 


4. Integrate Jy’ by parts. 


§ 2, 3, p. 247. 
1. 1/24. 2. 0. 3.0. 


4. 7/8 if region of integration is restricted by the condition z > 0; 
otherwise zero. 


5. 1/50400. 6. 7(2 — 3 log3). 


7. Introduce polar co-ordinates and integrate first with respect to 
p and 6: (2+ 3 log3). 


8. 4log(1 + V2). 

9. Divide up the interval of integration into the segments 
—lsas—Wh, —-Wh Sash, Wh Sa SL, and find the limits of 
the integrals along each segment. 

§ 4, p. 255. 
1. Apply the substitution 7+ y= & 2—y=m = (c —-~-)}). 
1 
2. Introduce polar co-ordinates: (a) was =, (b) ue arc tan -. 
4 2 2 2 
3. Substitute « = af, y = by, z = cl: Zarb%c%, 
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7. Introduce new rectangular co-ordinates (&, 7’, ¢) such that 
B= (E+ yn+20)/r. Then d&dndt = dt/dx/ dt (cf. Ex. 9, p. 38), and 


i ij i) f cos (rE’) de’ day dt? 


throughout the sphere &?# + 7+ ¢? <1. Hence, if we perform the 
integrations with respect to 1’ and (7, 


I=n 4 vt eos(ré’)(1 — &2)d8’. 


4 ; 
Answer, = = _ cosr where 7? = a? + y? + 23. 


8. Substitute & = (z&+ yn)/r, a = —(YE+ zn)/r, and integrate 
with respect to 7. 


§ 6, p. 275. 


1. Apply Guldin’s rule, using the fact that the centre of an ellipse is 
also its centre of mass: 27?ab. 


2. mabh?/2. 
3. Substitute x = a&, y= by, z= ct 
1 Eee Ee es eee 
3 re ( ~ J (a2l2 + bm? + =a) @ + V (al? + b2m? +- =m) 
4. (a) Compare corresponding elements of area. 
2 
() at [ (1 — conf(e)}d9s (0) Bx( — 420% 
0 


ar tanhe 


5. ona*{1 + (1 — e?) \ where 2a is the major axis. 


é 


6. Volume = Ancp*, surface area = m(a + b)p, where a, 6, ¢ are the 
sides of the triangle and p the perpendicular from C to AB. 


7, From the differential equation (cf. Ex. 1, p. 182) satisfied by the 
tube-surface u = u(x, y) we have 


A= af fva ug + uy) dedy = 2f fea. 


u 


If we introduce as parameters the length of are s on L and the distance ¢ 
along the normal to L, then (cf. Ex. 22, Vol. I, p. 291, and Ex. 3, Vol. II, 


p. 182) 
+1 (1 + Kt) 
A=2fasf a” w= an ds, 
ep Vasey ds 
where k denotes the curvature of L. 


8. Integrate first with respect to x and y: (a) 16r2/9; (6) 87%. 
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rt R+V(R? + h®) 
9.5 {Rv (Re +) VC + WO) + Wi log I. 
10. Introduce polar co-ordinates: 1*/2. 


§ 7, p. 285. 


1. On the axis of the cone, two-thirds of the way from the vertex to 
the centre of the base. 


2. a= 2n,/38, y= z= 0. 

3. Cf. Ex. 7, p. 275, and Ex. 1, p. 182. 

4. (a) wh(R4— BR’); (b) 2xh(R? — R*){4(R? + RA) 4 fh}. 

5. For example, 4+ B—C= 2 f { f uz" dadydz, which is positive. 


6. Substitute «= at, y= by, z= cl; use the expressions for the 
moments of inertia given in the text and the properties of symmetry of 
the ellipsoid: 


(a) Zz nabo(at + 08); (0) A reabo{(A— o8)a* + (1— BA -+ (1 —y"ep. 


7. The distance of the point (x, y, z) from the plane uz + vy + wz = —1 
is given by 
uz + vy+ wet 
Veet + wil” 


The moment of inertia of the ellipsoid with respect to this plane is there- 
fore given by 
Av+ Bi + Cw+ V 
w+ 2+ wi : 


where A, B, C denote the moments of inertia with respect to the co- 
ordinate planes and V is the volume of the ellipsoid, ie. A = 4a%be/15, 
B= 4ab*c/15, C = 4abe3/15, and V = 4abe/3. We have now to find the 
envelope of the planes for which this expression is equal to h. The envelope 
is given by the equations 


(A — hju = da, (B— h)v = dy, (C — hw = 2, 


where A denotes a common multiplier, which from the expression for the 
moment of inertia and the equation of the plane is found to be V. By 
squaring the three equations we obtain the equation of the envelope, 
namely, 
a ¥ za 1 
hoa SO ao 


9. a%(x— E+ By — 9) + Me — 0) 
= {ab + 4 ott 5+ oP + OS (e— EP + (y— att (2-0) 
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5a 2a? + B+ 
16 a+ 8+" 


2a’ a nL 
13. —.—; Veo a) C+ - 1)}. 
14. Integrate firat with respect to a and y: 
5 ,———______ 
anf VF Ged — 2/0 F atl, 
Pas 


10. (4, 0, 0). ll. a= 


where the upper or lower sign is to be taken according as the origin is 
inside the body or not. 


Appendix, § 2, p. 298. 


1. § consists of unit circles orthogonal to C and having their centres 
on C (cf, p. 295). 


Appendix, § 3, p. 307. 
(Vay 


Eee) 
2. By p. 301, 


fe SIS %) 
I= f- il fase ape 


—a22%—. 


1. Substitute x, = 4,8, ..., %= Ag oo Oye 


taken throughout the interior of the (x — 1)-dimensional unit sphere in 
Z_+++%,-space. Introducing polar co-ordinates, we obtain 


=f" ar f VIP) t+H-VI=*) a, 
0 Fe) 1—r 


where S(r) denotes the sphere of radius r and centre Oin 2... Z,-space. 
As the integrand depends on 7 only, 


I= f VERA OVE pana 
= On-1 . 
) 1—r 
Putting y= 1/1 — 1°, we have 
n—-38 


+1 n—8 
T=@y-1 Fy — y*) ? dy. 
-1 


Appendix, § 4, p. 317. 


1. Put I,(a) = if at%e—e"de; then I,,(a) = —I%_2(a), where dashes 
denote cares with respect toa. Alternatively, integrate by parts. 


1.3.°°°(n—1) ‘ 
RE when n is even. 


: 
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2. Substitute & = ax + By, n= yx + dy, where «, 8B, y, 8 are chosen 
so that 
B+ 4? = ax? + Qhay + cy* 


Then («3 — By)? = ac — 6, and the integral is transformed into 


] eo feo} 
cae Te RR 
—a/ —w 
ac—B=7n,a>0. 


3. Make the same substitution as in Ex. 2 and evaluate the resulting 
integrals, (a) using the result of Ex. 1, (b) introducing polar co-ordinates. 


mal + cA — 20B) 2x 


Oey! CO rae 


4. (a) Forming K‘(a), where the dash denotes differentiation with 
respect to a, and integrating by parts twice (taking xe—%** as one factor), 
we have K’(a) = —K(a)/2a + K(a)/4a%, i.e. 


ae at 
K(a)= Ca %e %&, 


20 t 
here C is given by C = lin+/a K(a) = lim -# ~, db= Fe 
where gi y = tim Va (a) i e-F cos 5, han 


7 a>aorod 
K(a) = r= e *, 


on 
(b) Integrate the formula = a e—* cosxdx with respect to t 
from a to b. 6 
1+ a? 
log ———... 
B08 


(c) Substituting z= 1/t in the expression for I’(a), prove that I’ = 
—2I, ice. 
1 = Ce—?a, 


co 
where O = lim J = vi e-* dx, 
a—>0o Q 


3/ re—2a, 


(2) Substitute the integral expression for J, and change the order of 
integration. Use the formula 2 sinax cosbat = sin(a +- bt)x +- sin (a — bt)2; 


Bee 
ef. the expression for aa dy on pp. 307-8. 
6 


m/2 when a > b, arc sina/b when a < 6. 
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or 
f(x) = const. =¢, g(x) = —e for +20. 


For ¢ = 0, moreover, 
p(t) = (0, t) = f(—at) + gat) = f(—at) -—6 
that is, f(Z) = 0+ (+) if £ <0. Asa-+ at 20 always, and hence 
—a 
g(% + at) = —c, it follows that 


0 for x —- at 20 
u(x, t) = {o(2=") for z— af SO 


—a 
if both x and t are non-negative. 
5. Lf u(x, y) = La,,2’y4, then 


a, 1 +1 > ___ Wes 
mer WF Dut iy 


in addition, 
4,09 =%,=90 for v21 and m=. 
Hence 


~ a” Vv id 
ue, y)= EE = J aiV/ay, 
yo V! 
where J, is the Bessel function of Ex. 4, p. 223. 


6. (a) From the differential equation we get 
(fm)? + Oy)? = 1 


(f(x)? = 1 ~ (9'y)"). 


As the left-hand side does not depend on y, nor the right-hand side on 
x, both sides are equal to a constant (which has to be positive or zero), say 


c*; that is, 
(f@y=He, 1-Gy)yp=e. 


w= cx-+ /(1— chy tb 


is a solution, where c and b are arbitrary and c? S 1. 


or 


Hence 


(b) «% = f(z) + gy) gives 


f(z) = een ee a 


gy) 
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so that 
a=martiy+o 
a 


(where a and b are constants). 
If w = f(x)g(y), then 


Z 2 me a 2 = ° 
5 Very = 4] 5 (ou) = const. = 20 
so in this case 
2 
w= V { (20 + a)(2 y+ v)}, 
where a, 6, c are arbitrary constants. 


7. A one-parameter family is obtained from the two-parameter family 
of solutions z= u(2, y, a, b) by making a and 6 depend in some way on 
a parameter é 

a= fit), b= g(t), 
z= ula, y» f(t) g(t). 
The envelope of this one-parameter family is obtained by finding ¢ from the 


equation 
O= = Ugh + mg’, 


and substituting this expression for tin z = u(a, y, f(t), g(t)). The result is 
again a solution of F(x, y, 2, 2m, 2) = 0, as 
z= ua, y, a, b) 
Zy = Ug + Uzty = Ug(%, y, a, 5) 
By = Uy + Uyzty = u,(x, y, a, 6) 
and z= u(x, y, @, b) satisfies the equation F(x, y, z, z, z,) = 0. 


8. side ah +k af y 
sais Pay y oe gtk 


CHAPTER VIL 
§ 1, p. 497. 


1 2 Ke — Xo) + (yy — 40)? 
; V2 ¥— Yo 
2. t= [soa + 786? 9 sin? ¢* do. 


§ 2, p. 505. 


od 
1. Parabolas y = ¢? + re 
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2. Circle with centre on z-axis, 


xwr—-a 


3.y=csin 


4. y= qeit bforn > Land y= alogz +b forn=1. 
5. y = aa — by@+™ ifn+m+0; y = ae ifn = —m, 
6. ay” + a’y’ + (b — cly=0 


a b 
(for b = const., iM byy! dx = 5 (ys® — 91°) 


only depends on the end-points of the curve y = y(z)). 
fd 
7.91 — Yo < 3 


8 Consider F(x, y) for fixed x as a function of y; let this function of y 
have a minimum for y= g. Then F(a, y) 2 F(x, 9) for a certain neigh- 
bourhood of 7 and F,(x, 7) = 0. g will depend on the parameter 2; i.e. 
y = 9(x). Then for any neighbourizig function y we have 


ay ay 
F(x, y(x))de = "E F(x, 9x) da, 
Xo 


Xo 
where §(x) satisfies the equation F,(2, 7(x)) = 0. 
9. (a) y= 0. 
(b) Use Schwarz’s inequality. For any admissible , 


1= y(1) — (0) = fs vars af ( f ‘Yax)a/( uf yrde) = VE 


and the equality sign holds for y = x. 


§ 3, p. 510. 
1. Ifv = 1/f(r), then T is given by Ex. 2, p. 497: 
P= f(y + POF sino &). 
Euler’s equation for the variable » gives 


of272 sin 0 
7p = LOSES — const. = 0 


along a ray. Now let the polar co-ordinates be chosen in such a way that 
the plane » = 0 passes through the initial point and the end-point; 
since = 0 at both these points we have 9 = 0 for some intermediate 
point, by the mean value theorem, that is, C = 0; but then ~ = 0 for the 
whole ray, ic. p = 0. Hence the whole ray must lie in the plane 9 = 0. 
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§ 3, p. 518, 


The law of conservation of energy gives 
T+U=T=-; 3 (SY = const. =50% 


ds 
hence qi = const. = C = initial velocity. 


Then Hamilton’s principle asserts the erga ee of 
f (1 — U)dt = f “nat = 50 a= 50 aa 


stationary character of Hamilton’s integral implies that the length of 
path is stationary. 


Miscellaneous Examples VII, p. 520. 
1. From the differential equations for geodesics (p. 518) we find that 


dz 
for a cylinder, ie. if G does not depend on z, a= const.; hence the 
geodesics on a cylinder make a constant angle with the ay-plane. 


2. (a) g(a) ~ iter =0. 
6y’( 119 + 4y’ ueny Qt 48 yA" 

0) 0) — et Rt eee” 

(yty’+y™. 

(2) (2 — yy” = 0. 
3. (a) od = (ay + dy) y+ (Dg + Cy) Py + Pre + 2WPny + CPyye 
(8) A%p = 0. 
(c) A% = 0. 
au’ + a’u’ + u(b’— oc) 
eens 


4. 4 = const. 


5. (a) Euler’s equation gives 
f+ 2su = 0; 
1 
from this equation and if q’dx = K*, we have 


eed VS faz) KS 
a) a MS fay 


(6) For any continuous admissible @ we have 


I = [ foae < V(fre) Vf’) =K Vf re), 


the equality sign holding for o = wu. 
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CHAPTER VIII 
§ 1, p. 529. 
1. Forz 20. 
2. Use the principle of comparison. 


3. The coefficient of 2” in the expansion of cos?z + sin?z for n > 0 is 


am (1  (-19 Bn _ 
© i. vi(n — v)! a nt 2 od) (") oaks 


(cf. Vol. I, p. 28, Ex. 2(b)). 
4. The series is convergent if, and only if,| z| <1. Forif|z] = 6 < 1, 

then 

a 


a 
ae 2 So 6” 


=i-y*i-e 


v 
and we may compare with the geometric series. If |z|> 1, then <> 
tends to —-1 ag v increases, whereas in a convergent series the terms must 
tend to 0. If |z |= 1, either there are terms in the series which are not 
defined, or at least its terms are not bounded, since z” may approach I as 
closely as we please. 


§ 2, p. 535. 


Let f(z) = u + iv, 9(2) = w’ + iv’, fo = p + ig, where p = wu’ — wv’, 
q = uv’ + vu’. Assume that u, v and also w’, v’ satisfy the Cauchy-Riemann 
equations, and prove that the same is true of :, q. 


§ 2, p. 536. 


1. The functions (a), (5), (c) are continuous everywhere; (d) is discon- 
tinuous at z = 0. 


2. None. 
3.(C/?= t= ace + BB + (afe + 282) 
: OBze + aw + (aBe + az) 
Now for ax —~ 8B = 1 the difference between the numerator and the 
denominator is 
a— 1, 


so that the numerator is greater than the denominator for | z| > 1, and 
smaller for {z{<1. If 88 — «% = 1, the converse is the case. 
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4. If z= relt, € = & + ty, then 


lf r = const. = c, then 


3 
cee 


e2 
if aaa iyi 
i(°+ 5) 4\° «) 
if p = const. = c, then 


e2 1? F 
cose | coste—1 


(ef. Ex. 5, p. 158). 


6. First transform, by putting C = az + 8, into the unit circle; then 
1 
apply the transformation ¢/ = rons 


7. The equation of a circle or straight line in the ¢-plane is of the form 
att + BC+ BC+ y=, 


where and y are real; if we here substitute the expression for ¢, we get 
an equation of the same form for z. 
For a fixed point ¢ = 2 we have the quadratic equation 


cz? -+ dz —-az—b=0, 


which has in general two different solutions. A circle through the fixed 
points is, as we have just shown, transformed into a circle and must again 
pass through the fixed points; the family of orthogonal circles transforms 
into itself, because circles become circles and the transformation is con- 
formal. 


§ 3, p. 545. 
2. The series is absolutely convergent, by Vol. I, p. 382. 


§ 3, p. 551. 


1. By the Cauchy-Riemann equations the partial derivatives v, and v, 
of v are given; a function v with these derivatives does exist, since the con- 
dition of integrability u,,,-+ uy, = 0 is satisfied (cf. p. 353); v is uniquely 
determined, apart from an additive constant ¢, and is given by the curvi- 
linear integral 


(x,y) 
u(x, y) = (vydy + v,dx) +c. 


(Zo, Yo) 
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It also follows from the Cauchy-Riemann equations that v is a potential 
function. 
§ 4, p. 553. 

1. It is easily seen that 


_1 io 2 
We = sf eae 


is an analytic function of z. By differentiating under the integral sign and 
using Leibnitz’s rule (cf. Vol. I, p. 202), we find that ni)(z) is 


1 £ (v = = f(S)  anwety 
ii 2, (C) unt 1)... (0 etvt Df erin - 
— pe MK n f(%) gn—pty 
Qni aC) (Cagyrrt oO oe 


Only the terms with »—v <n differ from zero, as otherwise ee i ) 


vanishes. On the other hand, a term with up — v < 7 vanishes for z = 0; 
if u <n, there are no other terms, so that h©)(0) = 0. If pu 2x, there 
remains only the term with y— v= 7, so that 


HH(0) = & f coe dt = f\(0). 


t-— zjyeti 
1 ¢ fO 
alae lea fen 


radius p about the origin. 


< = a 5 20, where C is the circle of 


3. f a dz is equal to the sum of the residues of f in the interior of C. 
oa 


Now if f has a zero of order 7 at z = 2p; 


f(z) = (& — 2)" 9(2), 


f@ _ nee) + (2 — a0)9"@) 
i) (2—~)9(z) ” 


where (2p) + 0; 


f) 
f@ 


4, (2) The number of roots of the equation P(z) + 6Q(z) = 0, by 
Ex. 3, is 


at z == Z is 2min. 


so the residue of 


1 rP(z) + 8Q%2) de 
QniJdy Plz) + 9Q2) 


The denominator differs from zero for every @ for which 0 <6 1 at any 
point of C; the whole integral is therefore a continuous function of 6. 
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As its value is always an integer, it is constant, and hence the same for 
6=0 and 6=1. 


@) If|a}<r—*, thenr > 1; so the equation 2° + 1 = 0 has five 


roots inside the circle |z| = 1; if we put P(z)= 25+ 1, Q(z) = az we 
have on the circle | z | = 7, 


|Q2@)|=|a]r<rf—1<|/2+41|= [P(e]. 
5. Cf. the proof of Ex. 3. 
§ 5, 3, p. 559. 
1. The left-hand side of the formula is the sum of the residues of the 
1 ak 
function 2*/f(z), and is therefore equal to Omi f jo dz round a circle 


enclosing all the roots «,. But this integral tends to zero as the radius of 
the circle tends to infinity (the centre remaining fixed), 


Miscellaneous Examples VIII (p. 567). 


1, 2% must be real. 
%_ — 2 
2A= an [3 *1— *4 must be real. For if 0 is the circle through 
2" “3 


By — By oz + p 
yy %, 23, we may transform C by a linear transformation ¢ = 


ye+s 
into the real axis (cf. Ex. 6, p. 587); by Ex. 5, p. 587, A is ighenges. 
then a necessary condition that the image of z, shall lie on the same circle as 
the images of 2,, 2g, 2, is that it is real, which is equivalent to A being real. 


3. The equality to be proved is 


V la — 2] [2s — %| + V [a — 25| | 2) — | 
=V | 4% — 2 | | 2 — %4 | 
D+ 4/ 


or 
(2 — %)(% — %) — %)(23 — 2) 
(@_ — %)%1 — 2a) — %)(% — %4) 


(2 = #s)(@a — 4) |, 
(%_ — 2)(2, — 24) 


= 


Now the expressions under the square roots are invariant in a linear trans- 
formation (cf. Ex. 5, 6, p. 537). If by a suitable linear transformation we 
transform the circle into the real axis, we have only to prove the relation 
AB.CD+ BC.AD= AC. BD for four points on a straight line, where 
it is trivial. 


4. € = ef takes every value except € = 0, as is easily seen from the 
telation e#* = e-¥(cosz + isinz). Now we have to choose ¢ so that 


o= cose = 5 (% + 5): 
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this quadratic equation always has a solution 
S=cive—-l, 


and this solution is not zero, so that a corresponding z exists, 


5. Cf. Ex. 4. If 0 = e*, then 


1 
ie 
tanz = - ——- =¢ 
te 1 
4 
or 
i+t 
c= 1l— ie’ 


there is a finite ¢ + 0 only when c+ +1; hence tanz =e only has a 
solution if ¢ is neither -+-+ nor —7. 


6. If z= 2-4 wy, cosz is real if x= mn or y= 0, and sinz = 0 if 
a= mn + 5 or y = 0 (where n is an integer). 
7. (a) r= 1 (for |z|> 1 the individual terms tend to o, for | z| <1 


compare with the geometric series). 
(b) r= 0. (ce) r=1. 


8. Cf. Vol. I, p. 175. 


eft 
9. (a) Integrate ine 


over upper semicircle: 


Ze rtd 
(b) Integrate ——— ize over upper semicircle: 
v2 
nv? 2 (cos V2 — cin V2), 
4 2 2 
(c) Integrate 2 - “ap a OVED Upper semicircle: s 4, 
gaat 
d) Inte, es aa ion bounded b 1 i 
Nee aaa) ee eee 
about the origin and slit along the positive real axis: ——__——— 
ste 


10. (a) +2ri at z= 2nr, —2ni at z= (20+ 1)x. 
(b) +270 at z= 2nnx + =, —2ri at z= 2nn + _ 
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(c) Use the functional equation P(z)= T(z + v+ D/ae+1)...(2+ v): 
c e 


Qri at z= —n. 
(d) Qnt at z= ni. 


11. Write Ct _ cotmt , zcotmt, 


——$ cotrt is bounded on the squares 
t—z t ut — 2) 


nm 
cancel one another; hence 


¢ t 
tim ff FO at tim fe a= 0. 
n>o Jo, $—% n—>o Jo, Ub— 2) 


If we put together residues of opposite poles, the sum of ‘il residues 
converges and we obtain cotrz = =( : + a ae gt: +) 
wt 


(cf. Vol. I, p. 444). at 1 

1 
12. ~—=]—t+Ff~—~+...407 —1)j" ; 

if? + + + + ( "Tet 
Hence 

28 2 n 

log cre) SB goa ak +— +R, 

where 


ee ud 
=(—1)" f eae a 
a er 
If we take z = e!? and the straight line from 0 to e as path of integration, 
we have, for e + —I, 


1 @ a 
Bal=| [Poa a Is 2 [ima = ay 


where m denotes the minimum of 1+ ¢®for0 Si SL. Hence if 
z= e+ —1, R,, tends to 0. 


1 1). 
13. (a) f@)= 3 (aap Te 


now 
1 1 2v 1 
ea ae yaw 
| z | kal 


=(@—)*|]~ @—1" 


1 
and the series & @v is absolutely convergent for z > 0. 
v 
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(b) (1 — 2*-*)E(2) 


1 1 1 2 2 2 
jltatgetgatoc es # @ 
1 1 1 
=l-gtgo pt Se. 
= — @-1 fd) _ 
() tim (@ — YE) = 701) Bim, ga gay = 


where 
g(z) = 1 — 2%, 


MISCELLANEOUS EXAMPLES 


1. (a) If there were a linear relation ax + by + cz = 0, where e.g. 
a+ 0, then by scalar multiplication of this relation by x we should get 
axx + byx + czx = ax? = 0; hence a= 0, since x? +: 0. 

(b) The relation ax -+ by + cz = 0 is equivalent to the system of 
linear equations for a, b, ¢, 

ax, + by, + cz,= 0 
ax, + bys + cz, = 0 
ats + by, + cz = 0. 


These equations have the unique solution a = b= ¢= 0, unless the deter- 
minant vanishes. 

(c) The vector equation v= ax + by + cz corresponds to three 
ordinary linear equations for a, b, ¢ which certainly have a solution, since, 
by (0), the determinant is not zero. 


2. Take a co-ordinate system Ox, Oy, Oz. Then (a) reduces to the 
multiplication theorem for determinants; (b) reduces to the identity, 


24%! + ye! + Lgty § ByYy" + eo’ + ays" = | 72% x q’ Xa" 
Yrty + Yetta’ + Yara’ Ye + Yala’ + Yes" YoY Y2xYs" 
Ugly aay" Xe 2y'Hel 
+ | YsY1 Ys Yr a Ye yn'y2 | 


which is easily verified by splitting up the left-hand determinant into a 
sum of nine determinants; (c) may be verified by calculating the com- 
ponents of 2, y, 2; (d) is an immediate consequence of (c) and 1(6), since 
by (¢) 

[x(yz]] + [vlz~l]] + [efx] =. 


Finally, if x, , 2 are vectors lying respectively in the three concurrent 
straight lines, then the plane through x which is perpendicular to y and z 
passes through x and [yz], ie. its normal has the direction of [x[y2]]; 
the three normals lie in one plane, hence the planes pass through one line. 
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4. A rotation of Oz’y’ through the angle ~ leads to a new system 
Ox’’y’’. A direct passage from Oxy to Oxy” gives the desired result. 


5. (a) In the co-ordinate system Ox, Oy, Oz take the vectors (a, 1, Y1)» 
(22, Bor Yo), (a> Bs» Ys). Lf the determinant is orthogonal, the vectors will 
form a new orthogonal co-ordinate system Ox’, Oy’, Oz’. 

(6) The passage from the system Ox’, Oy’, Gz’ to the system Ox, Oy, Oz 
is given by the determinant 
% A Xs 
Br, Be Bs 
Y: Ye Ys 


which again must be orthogonal. 


6. Pass from Ox, Oy, Oz to Ox’, Oy’, Oz’ by the following three rota- 
tions: (1) Rotate Ox, Oy, Oz through the angle 9 about Oz, so as to form the 
new system Oz,, Oy,, Oz, (Oz = Oz). (2) Rotate Oz,, Oy,, Oz, through @ 
about Ox,, obtaining Ox,, Oya, Oz, (Ox, = Ors, Oz = Oz’). (3) Rotate 
Ox,, Oy2, Oz, through ~ about Oz,, obtaining Ox’, Oy’, Oz’. In each of 
these steps the change of variables is to be performed according to Ex. 3. 
Finally, eliminate the intermediate variables 2, ¥1, 2, ey Yo: 223 this is 
best done by multiplying, in the correct order, the three determinants 
corresponding to the above rotations. 


> 


7. Note that coszOx’ = cos@ cos) — sing sin) cos 0. 


8. If a is a unit vector in the direction of the normal to the plane and 
4 @ unit vector lying in the straight line, then 7 — 9 is the angle between 
a and 6. It follows that 
Aa + BB+ Cy 


Pr aren Rt B+ Oe + Fy 


9.¢2@=3,y=2,2=1. 


cos§ —sin® 0 cos % cos 8 cos Y | 
11. D=/sin6  cos@ 0| x {sine sin B siny | 
0 06 1| {sin(@—y) sin(y— a) sin(a— 8) 


the first factor is equal to unity. 


12. Adding the third and second column to the first, dropping the 
factor A + 2B, and subtracting the first row from the second and third 
row, we have D = (4 + 2B)(B— A)* 

= {(@ty+2) (+ P+ 2 — ay — a2 y)} 

13. In order to see that the determinant represents a linear function, 
subtract the first column from the other columns. By substituting s = —a@ 
or z= —bin A, we get A and B. 


14, As uv = 1, Leibnitz’s rule (cf. Vol. I, p. 202) gives 


uv + ur’ = 0 
wy + Qu’ + uv” == 0 
ues + Bu’ + 8u’v” = —u”, 
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These equations, considered as linear equations for v, , v”, have the 
determinant —D. If we solve the equations for » by the rule given on 
p- 25, we have 
0 u 0 
0 Ww u 
— ul” 8u’’ 3u’ 


= yy" . 'D, 


i.e. o'” = Dv/u® = D/ut. 


15. (b) Put z= logu; for z we then have the equation 2, = 0, i.e. 
z, does not depend on y. Let z, = 9(x); then 


z= ['owde + v0. 
If we put 


dx 
ef Pd Hay, XO = gly), 
then 
u == e* = f(x) X g{y)- 
17. Differentiate F(u,, vy) = 0 with respect to x and y. 


18. w is of the form 


w= 4 (5) ae a(Z). 


= | Qhe + dky + fh? + 212 
Vat @ + hy + Ay + H+ VAT ST 
S| Qha + 4hy + h? + 2% | 
S2|h?+ + 2ha + ky | 
S20? + 2+ QV (i + B)V (+ ¥) 
S2V (i + ) 1+ V2? + yh} 
if we assume that h? + kK? <1. Thus e.g. 
\fe@thytk)—fzy|<e 
2 Sh ee as 
for Vv (hi +) Ss V@tW 
20. Let «= at, y= bi; then 
# 
i = lim ab! = 
ae fe 9) sae : (a? +- 64¢7)® 
lim g(x, y) = lim ee 4 ep 
se 4-50 at? +. 6342 — at mi 
but if (, y) approaches the origin along the parabola y? = a, f(x, y) = § 
(2, y) = 1. 
21. Let C be given by the equations z = x(t), y = y(t), where 2(¢) and 


0, 
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y(t) have continuous derivatives. Let two points on C correspond to 
and t,. Applying the mean value theorem, we get 


t eg Soa, 
i= A Ve + xdt = (t, — t)\V/2(,)? + y(t) 
‘1 


d= V/[xlte) — a(t) + [y(te) — ye)P 
= (te — AV E(t) + Hr) 


where 1, Tz, ts lie between ¢, and #,; 
d—1= oft, — i), 


V ity) + H(t)? — Va(t2)? + ots)? > 0 as t, — t,->0. 


22. As the series has positive terms, it is sufficient to prove its con- 
vergence and find its sum for any order of the terms. Puta + b=; then 


a 7 fn a 2 y 
o- 2 3 C) ata - 2 5 8 (0 () 


since 


n=0 a=0 Lae 0 
eo 

- Bey (ee ae 
n=0Yy" & x 


since the relation 
n 


>> (2) az* = na(1 + 2)" 


a=0 


holds. (This may be proved by differentiating the identity 
n 
2 (") 24 = (1+ 2)"), 


a=0 
2 1 
Thus Pas Geir 
vn=O Y @& 
eee 
ey 
ey 


24. Sf dots denote differentiation with respect to the length of are a, 
we have 
k= V x. 


= xP L xe’, 
ge Be dt (axe): 
ay on." aa 5, (ay? 


Now 


hence 


all Agg/2 ( x’ ita 
(%)? 
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25. By definition 
at y?=1; 
hence, by differentiating, we get 
ala’ + yy’ = 0 2 ew ew ew we (8) 


If we put 
a'y”” - a’y’ =%, 
we have 
a= yy, y= 1s 
and hence 


8 2 = 3, 
Now as 2’ = a, 2” = 0, the osculating plane (cf. Ex. 1, p. 93) is given by 
—ay'(& — 2) + (n— y)ae” + (C—2y=0 «+ (0) 
it obviously contains the normal of the cylinder, given by 
(6 — xe + (n— yy =0,5=% 
By (a) the curvature (cf. Ex. 2A) is given by 
_@ityt+aetty)_ 
Gt yt + oF (I+ a} 
By (6) the binormal vector (cf. Ex. 7, p. 94) is given by 


————— oe ES a, Ee 
a(x”? + y’) + 7 ee + 7 aP{x/8 + y?) + =) 


or (Foe ee are) 
or (Fae eee V1+ a? aa): 


Since + is the length of the derivative of this vector with respect to the 
length of are (the element . which is (1 + a*)ds), we have 


a"? ees 
te a t= LaF 


26. Cf. Ex. 1, p. 93. The equation of the osculating plane is 
f+ i’ = Ef” cos0 + f’ sind) + a(f” sind — f’ cos6) + G 
the distance of the plane from the origin is 
(FAS UV ASA +S) 
which reduces to V/(1 + 1/A?) in the special case. 
27. Cf. Ex. 24. 
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28. (a) According to Ex. 3, p. 19, the plane is given by 


fats — x Roi — y Ch, — 2 
fat.5 — x $bt.? —y d,—2z|=0. 
fats — x $0t,? — y cts — z 


(6) By Ex. 1, p. 93, the osculating plane at the point ¢ is the limit of 
the plane through three points which tend towards the same point, and is 
therefore given by 

382 bly 3x 
OS gt be a 
At the point of intersection (x, y, z) of the osculating planes at t,, t, ts 
this equation must be satisfied for t= #, and t=, and t=. Hence 
ty, ty, tg are the three roots of the equation above. Therefore 


3z 
gt atht 


sd = tt, + tts + tots, 


Lager 
a ‘1°2"3" 


These expressions for 2, y, z satisfy the equation of the plane in (a). 


29. If b, c are kept fixed and a alone varies, we have s = $bc sin A, 
ds = $becosAdA. From a* = b? + c* — 2be cos.A, we have by differen- 
tiation ada = be sin AdA; hence 

a 
ds = aaa cos.A da = R cos Ada. 

30. Denote the components of the vector AP by 2, y, z Then 
ada + ydy + edz 
Vat yt a) 


31. Using a self-evident notation, we have 


AP = V(e+y2+2), d(AP) = —a.dP, 


: d d : 
P=A—PA.a,P=—PA.a— q (PA).@, or PA.d=— @ (PA).a—P. 
By Ex. 30, 

£ (PA) =—aP= —a(au + bv + cw) = —u — (abv — (acyw, 


Now 
PAad = aut (abjva + (acwa — au — bv — cw 
= [(@b)v + (ac)wla — vb — we. 


32. P= au-+ bv-+ cw, hence P= Gut bv+ dwt ati+ 564 cw. 
Introducing the expression for @ from the previous example and the similar 
expressions for 6 and é, we get the required expression for P. 
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33. If (x*/a2) + (y?/0?)=1 is the equation of the conic, then 
(a? + y2)? = 4(a%2? 4 by?) is the equation of the envelope. Note that 
if the conic is a rectangular hyperbola this envelope is an ordinary lemnis- 
cate (a? + y?)? = 4a%(a* — y?). 


35. If P describes the pedal curve I” of I, construct on OP as diameter 
a circle in the plane perpendicular to the plane of I’; the envelope is the 
surface generated by this variable circle. 


37. An ellipse. 
38. A plane touching the parabolas has an equation of the form 
—ce + cy-+ez=1 or —Cx+ cy— a= 1, 
The corresponding envelopes are 
(y + 2)* = 4@ and (y— z)* = 4a. 

39. The proof resembles that for n= 2 (Appendix to Chap. TIT, § 1, 
p. 204). A positively definite quadratic form Ea,,;%, can be brought by @ 
suitable transformation 7,= Lee (i=1,..., 2) with a non-vanishing de- 


k=1 
terminant into the form Day,2%,= yet yt +++ Yn > mMxey+...+ 
2,2), where m is a suitable positive constant. ‘For the applications it is 
important to remember that a necessary and sufficient condition that a 
form © = La,r,, shall be positively definite is that its principal first 
minors of order 1, 2,..., ”, as indicated below, 


(ay: Ae i ys SRE Gin 


shall all be positive. ® is negatively definite if —® is positively definite. 


40. Sketch the curve f = 0 and investigate the sign of f throughout 
the plane. 


41. If P; = (xy y:), % = PP» we have 
3 3 
Pf= ude, = Bry — y;)da —(% — %)dy, 


which is positively definite. 

42. At the point P,. Note that the function f = 7%) + 7. + 1s is con- 
tinuous in the whole plane, but not differentiable at the points P,, P,, Ps; 
where it has conical points (like the function 7 = VJ (a — &)? + (y — ys)" 
which geometrically represents a circular cone). Investigate the derivative 
of f at P, in all directions round this point. 


43. According to the first rule we have to compute d?f from (3), with 
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datyy oo 0 5 Wm, G22, ..., Oy, substituted from (1). Note that (1) implies 
that 

PQ, = LP pane, Ide, + Pua Pat eee tH Png, Pim = 0 (U=1,..-,m); 


if this is multiplied by 4, and added to (3) for all values of p, we have 
f= PF = XP, dxdaxy because d’x,,..., dx, drop out on account 
of the relations (2). 


44. For F = f + Ap (disregarding a positive factor) we get 


in 
BP = E da,dity with dp = dr, +... + dat, = 0. 
tk 


Eliminating dz,,, we have to show that the quadratic form 


1,n-1 


ljn-1 la-l 
—@F = (da +... + dx, — Sdada, = idx? + Udx,da, 
ik Gk 
is positively definite. 
46. The co-ordinate axes. 


47. y= 2{14+2'). The two branches of the curve forming the cusp 
at the origin lie on the same side of their common tangent. 


48. (a) If we put f= lz + my + nz, p= a? + y? + 2? — o?, F=f— ro, 
then the conditions for stationary values are 


1 = Apxz?—-1, m= Apy®—1, n= ApeP—1, - » (A) 
Multiplying these equations by x, y, 2 respectively and adding, we have 
le+-my+nz=drpe® . . . . . © (B) 
Calculating x, y, z from (A) and substituting in = 0, we get 
Ap = (14+ mt + n@)liget—2, 


Substitution of this expression for Xp in (B) gives the stationary value. 
(b) Cf. Ex. 43. Here we have 


BF = —Ap(p — 1)(a?—2da? +- yP—2dy? 4+. 2?—2dz®); 


as Ap > 0, this quadratic form is positively or negatively definite according 
asp Sl. 


49. Minimum for x = 1, y = 4, saddle point for «= —1, y= >. 


50. Let AB touch 2 at P. Let A’B’ be another tangent, and let the 
new point of contact be P’. Then if dp is the angle between AB and A’B’, 
and we neglect terms of the second order, the difference of the areas of 
A’B’C and ABC is 


AS = “2 (aps — BP’), 


For the triangle of least area AS = 0, that is, AP = BP, 
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51. Apply the transformation 


a’ —xcose—ysina, y= asina + y cosa. 


52. Let S denote the curve f(x, y) = C and S’ the curve 9(2, y) = CO". 
S and 9’ have a point of contact in (a, 6). In general, f(z, y) — Cis positive 
on one side of § and negative on the other side in some neighbourhood; 
similarly with 9(2, y) — CO’ and 8’. If e.g. f(a, b) is a maximum of f, then 
f(a, 4) —- 0 30 on SV’, ie. 8 is wholly on one side of S; then § is also on 
one side of 9’. That is, (x, y) — C’ has a constant sign on S and as it 
is equal to zero at (a, b), it has either a maximum or a minimum there. 


53. The equation of the generating tangent is 
zsin® + y cosé = a(6 sin® + cos 8 — 1). 


a 2b 1 __)a 
a BIO; 0 1— [aa cost a 


since 
tan9 


1 1 
er 0 = Fame oe tans) 


we have 


d Tv 1 
ZI0- 20-75) 
and. therefore 


f(z) = wrlogd+V1l—2#)—« log 2. 


56. According to p. 273, 


S= i i VEG — Farad 
7 i ie e* aa Fiar 


6, 
=[V2+ log(k+ v2) [ | 4f2d0 (cf. Vol. I, p. 215), 
% 
which is [V2 + log(1 + v2)] times the area of the projection 


06,5050 OSrsf'(6). 


57. As A— BR? = 8, A— 2 BR* = 4}, wo have A= 10, B= 43/R'. 
The attraction at an internal point is equal to the attraction of the total 
mass of the points inside of the sphere of radius r concentrated at the 
centre of the sphere. 
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58. By translation we can ensure that the triangle lies in the upper 
half-plane. Then its moment of inertia is equal to 
PLY 1» 2) + O(LeY2, Las) + P(wsYs> 2Yr)s 


where (2,41, 22) denotes the moment of inertia of the quadrilateral 
With vertices (x,, 0), (t1 Yr), (a Y2), (%, 0) multiplied by the sign of 
(%, — 2). Then show that 


PAY» Leo) = zhy (% — Ve)(Ys® + yr2ye + Yao? + Yy2°)- 


™ 
60. 2— -. 

2 
61. Introduce polar co-ordinates with the pole as origin. 


2 4 ly 
62.1= [ y — 4)ay f da = 12 — 16 log2. 
1 (8y—20)/(y-- 4) 


B 4 
63. (a) K= f dO if rlogrtdr. 
0 o 


a, p(x) 
0) K=f (f logtat + say) da, 
0 ‘Wo 
where 
9(z) = x tanB for 0 Sx Sa cos§; +/a? — 2* for acosB Sra, 


64. V= J, wh® tan®a. 


r 
F = = Sos i i 
‘or V if f zdo i vi (4 -) dc, where the integral is to be 
extended over the region 


h tan a(1 — sin 4/36 cos2/36) <r < h tana. 


Lar htana r 
That is, V = f do if (x = \rar 
0 h tana(1 — sin 4/39 cos 2/39) ne 


Qa Qa 
=i tanta. 38in 8/39 cos 4/3040 — A tan?a }sin*O cos?0d0; 
0 0 


if we substitute sin*@ = y in the first integral, it becomes 


: Ped) 1) 
y5I8 (1 — y)'8 dy = BOA, %) = 2 
i y é T@) 


= FTG) TQ) = vem 


where we have made use of the extension theorem for the gamma function 
(cf. pp. 335 and 337), 
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65. The generators are the lines of the surface given by «= const., 
or by y = const. Thus, as d8 = (1 + 2,4 + zf)'2dady, 
a] € 1 v7] gE 
I=—] d dx —_—,,=-—| 4 
i . i “Crate i, YOF yt E+ 
En 
(lt + wpe 


d 7 J tout + eens) aw da Tw 
2 ap cf feet 
& ae” f da Cosa si 9 1+ acose V/1— a? 


thus K(a) = ~ arcsina + const.; the constant is determined from the 
condition K(0) = 0. 


= —arc tan 


67. Introduce new variables u and v by the equations u— 2/y, 
v= y*/z. The area then becomes 


b* pd b* a 
i 2(% ¥) dudv = af u—2i5 du f v5 dy 
a 


c a(u, v) a c 


(cf. p. 253). 


68. Take a co-ordinate system Ox,, O2,, O23, and denote the position 


vector of a variable point on I by x. Then a= 4 f x X dx has the re- 
r 


quired properties, for @,,= 4 f (a, dat, — %,da,) is the area of the projection 
r 
of I’ on the plane O2,75. 


69. The motion takes place in a plane, since p is a central force proved 
for the case p = 1/r? on pp. 423-4). Hence 


t= " 
= 7? 
45 ¥ 
j= —"D. 


r 
It follows that 
xy — dy = const. = h, 
—2xt — yy 
r 


tt + jy = p= —fp. 


Hence 
d 
© ge y2) —. — 40, 
$5,@t w= —#p 
The distance of the tangent from the origin is 
_ lawl A, 
Vet e Vet yg. 


23 (8912) 


q 
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therefore 
d he dr 
ag de 
or 
ad 
a Z = =P. 


which proves the first statement. For the cardioid we have q = r?/V 2ar 


70. Let (x, ¥1),-- + (Xn Y,) be the attracting particles. Then the 
resultant force at a point (x, y) has the components 


xy=5 _ 7~ ke ; y= = Y¥-—W : 
v V{(a a 2,)? F _y — y)?} v V {(x =: x,)? + (y = y)*} 


If we introduce the complex quantities 2, = 2 + ty, -+ +5 %, = qt Yy 
z=2“2+ 4, Z=X+iY, we have 


where f(z) denotes the polynomial (z — z,)... (z — z,) and Z the complex 
quantity conjugate to z. The positions of equilibrium correspond to Z = 0, 
i.e. to the zeros of the polynomial f’(z), of which there are n — 1 at most. 

Positions of equilibrium in the particular case: (0, 0), (V(a? — 6), 0), 
(— V (a# — B*), 0). 


71. By definition 

= —Nx — Quy 
j= — My + ye. 
Or differentiating the two equations twice and combining them we get 
an equation involving x only, 


B+ (222 + 4u%)G + tz = 0, 
and a corresponding equation involving y only, 
V+ (222-4 4y%)9 + My = 0. 


Thus x and y are linear combinations of e###+V™+H"4 (of, Ex. 1, p. 444), 
or of cos(u+ VA7+ uw), cos(u—V + yal, sin(u + VY 22+ py), 
sin(u —V 2? + y?)t, with constant coefficients a, b, c, d, and a’, b’, ce’, d’. 
From (A) it follows that a’ = —c, & = —d, c’ =a, d’=b. Using:the 
initial conditions z(0) = y(0) = y(0) = 0, #(0) = u, we obtain the result 
given. 
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72. (b) The equation becomes of the form treated in (a) if we mul- 
tiply it by 2°. It has the particular solutions w= z* and »= 25; hence, 
by (a), a third solution is given by w= 1 ++ 2; the general solution is then 


A(l + a) + Ba? + C28, 
73. The curve satisfies the differential equation 


ret) = 


or in polar co-ordinates 7, 0, with 6 as independent variable, 


nr an 
Sa SEC 
cos 6 r sind 
that is, 
dlogr n 
6 eee 
whence 
n 
ao(t 43) 
ie 2 4 ago tsineye 
cos@ ~~ gos®+19§ 


ccf. Vol. I, pp. 214-5). 


74. According to p. 482, a solution of the first equation is of the form 
z= f(x + at) + g(z — at). 
On substituting this expression in the second equation we have 
f97 = 9, 


ie. either f = const. or g = const. Hence z = f(x + at) or z= f(z — al) 
is the most general solution of both equations. 


75. Put wu = (22 + y? + z*)"/2 and let K be of degree h. Then 


n—2 

AU = thgy + Uyy + Uzg = 2(m + 1)(2* + y* + 2) 2, 
OK OK 0K 

© og 8 Uy oe (cf. p. 109). 


ith 
Hence u = (22 + y?+ 22) 2 is a solution. 


76. (a) The value of the integral round the small circular detour 
tends to zero as the circle becomes smaller. If we put z= e’ on the 
23° (#912) 
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unit circle and z= x, z= iy respectively on the axes, Cauchy’s theorem 
gives 
ar 


1 r 
0 ={ (« + 2) "2" de + if? (ef + g-tymeion 7g 
0 x 
pf. , Wy, 
oa i (iy ++ 2) (ty)"~" dy 
0 ay: 
1 7 
=f (« + 1)" attae +%4.2™ ye cos™ Bet" d@ 

9 


we m) 


Eat ee 


by equating the imaginary parts of this equation, we get 


am , cos” 6 cosn6d0 = sin” — ™! a i (-y + 3)" yay 
= } vin) f° (1 — n)My(n—m—2)/2 dy 


= }sin 5m — m) B(m +1, a—*) (of. p. 337). 


(b) Use the relation 


sin? = =m) Ga) = Wi 
2 2 ra _u— im (cf. p. 335). 


2 


77. If % + 0 and if C’ is a contour in the region in which f is regular, 
and contains y but not 0, then, by p. 549, 


a oyfy) mt f(t) 


Serene At AE; 
dy" (y + ayett  Qridy, (E+ ah tt — yt} 
lf we put a = y = V x the latter integral becomes 
nt tf (t) 


ori Jy. (2 — wert 


If we then substitute # = +, the integral becomes 
ca fv) dx, 
4nid, (v— x)ntt 


where C is a contour containing x but not 0; this integral is equal to 


42 V2) 
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2 iy — e—v—iv\ /er—ty ea~atty 
78. |sinh = (° == 
| sinh(x + ty) | ; 3 


= } (cosh 2x — cos2y) 
= }(cosh2z — 1). 
Integrate along the boundary of a square with sides x= +7(n + 4) 


and y= +n + 4), where 7 is an integer. As n—> © the integral tends 
to zero; hence the sum of the residues tends to zero. 
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Curvature, 86, 125. 

— centre of, 87. 

— of surface, 168. 

— vector, 86, 93- 

Curves, families of, 124, 169. 

— in implicit form, 122-9. 

— in space, 86-8. 

— on surface, 162 et seq. 

— singular points of, 127-9, 209-13 

Curvilinear co-ordinates, 138. 

—— net, 135. 

Cusp, 128, 210. 

Cylindrical co-ordinates, 142. 


Definite quadratic form, 205. 
Density, 235-6. 

Derivative. See Differentiation. 
Determinants, 14, 18, 19-22. 
— differentiation of, 58, 59- 


679 


680 


Determinants, functional. See ¥acobian. 

— multiplication theorem for, 36. 

Differentiability, 60, 65. 

— of complex function, 530-3. 

Differential, 59 et seq., 67, 72-3, 193. 

Differential equations, existence theorem 
for, 451, 459-63. 

— Clairaut’s, 466. 

— Euler’s, 497 et seq., 502-4, 508, 513. 

— homogeneous, 431, 438, 442, 445-7; 
449. 

— linear, 438-42, 559-60. 

— — of first order, 429, 450-4. 

— — of second order, 442-3, 463. 

—— with constant coefficients, 449. 

— non-homogeneous, 431, 438, 442- 

— partial, 468, 481. 

— Riccati’s, 443-5. 

— systems of, 462-3. 

Differentiation and continuity, 54. 

— change of order of, 55, 57, 218. 

— in given direction, 62. 

— of complex function, 531. 

— of compound function, 71. 

— of determinants, 58, 59. 

— of integral, 219, 240. 

——with respect to parameter, 218- 
22, 312. 

— of power series, 526~8. 

— of vectors, 85. 

— space, 235. 

— to fractional order, 340-1. 

Dini’s theorem, 106. 

Dipole, 471. 

Direction cosines, 3, 6, 12 

Dirichlet’s discontinuous 

Dirichlet’s formula, 321. 

Discriminant, 172, 180, 210. 

Divergence of flow, 371, 389. 

—— of vector field, 91 et seg., 404. 

Domain. See Region. 

Double integral. See Integral. 

Double layer, potential of, 472~-7. 

Double limit, 46, 101. 

Double point, 210. 

Du Bois Reymond’s proof of Euler’s 
equation, 499. 


4A. 
factor, 320. 


Ellipse, 127. 

Ellipsoid, 131, 264-5, 285-6. 

Elliptic integral, 221. 

Energy, kinetic, 280, 416, 511. 

— law of conservation of, 281, 416, 512. 

— potential, 415, 511. 

Envelopes of families of curves, 171-4. 

— of families of surfaces, 179-81. 

Epicycloid, 179. 

Equilibrium, 416 et seq. 

Equipotential surfaces, 470. 

Errors, calculus of, 68. 

Euler’s differential equation, 497 et seq., 
, 502-4, 508, 513. 

— integrals, 323-38. 

— multiplier, 457-9. 

~— relation for homogeneous functions, 

Tog. 
— representation of motion of fluid, 212. 
Exponential function, 534, 544. 
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Extremals, sor. 

Extreme values, 183-6. 

— — sufficient conditions for, 207. 

—-—with subsidiary conditions, 183- 
199. 


Falling body, 418. 
Families of curves, 124, 169. 

— — differential equation of, 454-5. 
—— of surfaces, 130, 170. : 
Fermat’s principle of least time, 495. 
Field of vectors, 82. 

Flow of fluid, 370, 384, 388, 396. 
Focal co-ordinates, 158, 392. 

Folium of Descartes, 117, 132. 
Force, field of, 372, 384. 

— lines of, 384, 470. 

Forces, space and surface, 391. 
Fourier integral, 318-23. 

Fresnel’s integrals, 317. 

Function, algebraic, 44, 118. 

— analytic, 532, 536. 

— complex, 531. 

— compound, 69 et seq. 

—- differentiable, 60. 

— homogeneous, 108-10, 

— implicit, 111-21. 

— many-valued, 563. 

— of function, 494. 

— of Berea variables, 39) 43. 

= rational, 44, 45, 556. 
Functional determinant. 


See Jacobian. 


Gamma function, 323-38, 545, 565-6. 

— — extension theorem for, 335+ 

— -— infinite product for, 333. 

ae fundamental quantities, 
168. 

Gauss’s product for gamma function, 


162, 


330. 
Gauss’s theorem, 360, 364, 370, 384 et 
$€q., 401, 402~4. 
Geodesics, 493, 517-8. 
Geometric series, 524. 
Gradient, 89, 92, 124, 131. 
Gravitation, 282-5. 
Gravitational fields, 83, 90, 351, 407, 413. 
Green’s theorems, 366, 390. 
Guldin’s rule, 274, 294 et seg. 


Hamilton’s principle, 510-2. 

Heine-Borel theorem, 99. 

Hélder’s inequality, 201. 

Homogeneous function, 108-10. 

— differential equation, 431, 438, 442, 
445-7, 449. 

Hyperboloid, 161, 168. 

Implicit functions, existence and con- 
tinuity of, 114, 117. 

Inertia, moment of, 278-80, 286, 

Inflection, point of, 125. 

Inner product, 7, 85. 

Integral as function of parameter, 216- 
2X. 

— convergence of, 257, 259, 260, 263. 

— curves, 451. 

—- differentiation of, 219, 235, 240. 


INDEX 


Integral, evaluation of definite, 554-6. 

— improper, 256~64, 307-13- 

— — convergent, 257. 

— line, 343 et seq. 

— mean value theorem, 232. 

~— multiple, 215 et seq. 

—-—-reduction to repeated integrals 
237-8, 241-6, 266-7. 

— of product of functions, 228. 

— over surface, 300-7, 374-84. 

——in more 3 dimensions, 301 
et seq. 2 

— transformation of, 247-54, 368, 373- 

Integrating factor, 457-9. 

Integration, change of order of, 239, 
310-2. 

— of analytic functions, 537-41. 

— of power series, 526-8. 

— of rational functions, 556~7. 

— to fractional order, 339-40. 

Intensity of flow, 371, 389. 

Inverse functions, derivatives of, 142-5. 

Inversion, 135, 153, 157, 168. 

— in space, 159. 

Irrotational, 372, 397+ 

Isoclines, 454- 

Isolated point, 210. 

Isoperimetrical problem, 
518-20. 


214, 493. 


Jacobian, 143-4, 147, 181, 154, 156-7, 
248, 253, 367-8, 377- 


Kepler’s laws, 422 et seq. 


Lagrange’s differential equation, 466. 

— dynamical equations, 512. 

— identity, 19. 

— multiplier, 190-9, 516-8. 

— representation of motion of fluid, 212. 

Laplace’s equation, 76, 93, 397, 472- 

— — boundary value problem, 478. 

—- — from variation problem, 515. 

— — in polar co-ordinates, 76, 369, 391. 

Lemniscate, 116, 128, 132, 710. 

Length of arc, 86, 162, 

Level lines, 90. 

— surfaces, 90, 131, 470. 

Limit of double sequence, 46, ror. 

Line element, 163, 273. 

Line integrals, 343 et seg. 

— — main theorem on, 352, 358, 397- 

Linear differential equation. See L.f- 
ferential equations. 

— equation, 23-6. : 

Linearly dependent functions, 439~ .0. 

Lines of force, 384, 479. 

Lissajous figures, 422. 

Logarithm, 341-4, 564, 567. 


Many-valued functions, 563- 
Mappings, 133 et seg. 

— of surfaces, 161-2. 

Mass, 235-6, 276. 

— centre of, 12, 38, 277. 

— moment of, 276-7. 
Maxima. See Extreme values. 
Maxwell’s equations, 485-8. 
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Mean value theorem, 80. 

—— for potential functions, 477. 
— — of integral calculus, 232. 
Minima. See Extreme values. 
Minimal surfaces, 515. 

Mobius band, 379. 

Moment of inertia, 278-80, 286. 
— of mass, 276~7. 

Momental ellipsoid, 286. 
Multiple integrals. See Integral. 
Multiple point, 128. 

Multiplier, Euler’s, 457-9. 

— Lagrange’s, 190-9, 516-8. 


Nabla, 92. 

Neighbourhood, 42, 99. 

Newton’s fundamental equations of 
mechanics, 413. 

— law of attraction, 413. 

Node, 210. 

Normal, 124, 130, 163-4. 

— vector, 86 


o, O notation, 48. 

Open regions, 42. 

Order of vanishing, 47-9, 551- 
Orientation of co-ordinate axes, 2. 
— of surfaces, 375-81. 
Orthogonal curves, 126. 

— trajectories, 456. 

Oscillations, small, 419. 
Osculating plane, 93, 94, 518. 
Outer product, 13 et seq., 85. 


Parabolas, confoccl, 126, 137, 139+ 

Parabolic co-ordinates, 139. 

Parametric curves, 165. 

Partial derivatives, 51 ef seq. 

Partial differential equations, 468, 481. 

Pendulum, 280-2. 

Planes, angle between, 11 

— equation of, 8. 9. 

Planetary motion, 422 ¢f sez. 

Planimeter, 297-8. 

Plateau’s problem, 515. 

Poisson’s integral, 479. 

Polar co-ordinates, 138, 143-4. 

— — derivatives in, 75-6. 

—— in space, 141, 254. 

— — integrals in, 254. 

— — Laplace’s equation in, 76, 369, 391 

— — volume in, 267. 

Poles, 469, 552-3- 

Polynomials, 43, 45+ 

— Hermite, 82. 

— multiple integrals of, 228. 

Potential energy, 415, 51I. 

— function, 91, 550. 

— of double layer, 472-7. 

— of force, 283, 350. 

— of mass distribution, 469 ef seq. 

— of spherical surface, 284-5. 

— — mean value theorem for, 477. 

Power function, 544-5. 

Power series, 525 et seq., 547-9. 553- 

Pressure, 392- 

Primitive transformations, 31, 32 149 
et seq. 
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Quadratic forms, 204-7. 


Rational functions, 44, 45, 556. 

Real numbers, 569 et seq. 

Regions, 41 et seq. 

Regular function. See Analytic fune- 
tion, 

Residues, 552. 

— theorem of, 553, 556-7. 

Revolution, area of surface of, 274. 

— potential of solid of, 286. 

— volume of solid of, 266-7. 

Riccati’s differential equation, 443-5. 

Riemann’s zeta-function, 545, 568 

Rotation, 5, 12, 19, 38, 83, 88. 

— vector, 92. 


Saddle point, 185, 207, 211. 
Scalar, 84, 88. 

Scalar product, 7, 85. 

Schuler’s pendulum, 282. 

Screw, orientation of, 2. 

— surface, 276. 

Separation of variables, 431. 

Sets of points, 96 et seq. 

Singular points of analytic functions, 


552. 
— — of curves, 127-9, 209-11. 
— — of surfaces, 211-2, 
Sinks, 370. 
Solid angle, 408, 474. 
Source-free vector field, 404. 
Sources, 370, 371, 469. 
Space di erentiation, 235. 
Sphere, area of, 270-1, 273. 
— centre of mass of, 278. 
— line element on, 168, 
— moment of inertia of, 280. 
— n-dimensional, 302-4. 
— parametric representation of, 160. 
— potential of, 283, 284-5. 
— tangent plane to, 131. 
— volume of, 267. 
Spheroid, 275. 
Stationary character of integral, 497 


Steiner’s theorem, 2'79-80. 
Stereographic projection, 160, 167. 
Stokes’s theorem, 365, 393, 402-4. 
Straight lines, equation of, 8, 9. 
—— shortest distance between, r9. 
Strophoid, 177, 210. 

Superposition, principle of, 438, 480. 
Surfaces, angle between, 130. 
— area of, 268~74, 300-7, 
— element of, 270. 
— family of, 130, 170. 
— Gauss’s fundamental quantities of, 

162, 168. 

— in implicit form, 129-31. 
— integration over, 300-7, 374-84, 
— normals of, 130, 163, 164. 
-— parametric representation of, 159. 
-— singular points of, 211-2. 

~ tangent plane of, 64, 130. 
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Tangent line, 65, 124. 

— plane, 64 et seq, 130. 

—— of quadric, 77. 

—— vector, 86, 93. 

Tangential equation, 213. 

— function, 213. 

Taylor’s series, 528, 549. 

—— theorem, 80. 

Tetrahedron, centre of mass of, 12. 
— volume of, 18, 27. 

Torsion, 94. 

Torus, 165-6, 2'74. 

Total differential, 66, 351. 
Trajectories, 456, 465. 
Transcendental function, 119. 
Transformations, affine, 27-33, 74, 78, 


; 133. 

—- combination of, 146 et seq. 

-~ determinant of, 28, 33 ef seq., 147. 
~~ general, 133 et seq. 

~~ of co-ordinates, 5. 

— of derivatives, 75~6. 

—- of integrals, 247-54, 368, 373. 

— primitive, 31, 32, 149 et seq. 
Tube-surface, 179, 182, 275, 285, 298. 


Rpderemines coefficients, method of, 

463-4. 

Uniform continuity, 97. 

Uniform convergence, Dini’s theorem 
on, 106. 

-~~-— of double sequence, 104 et seg. 

— —of improper integrals, 308 et seq. 


Nsiatiog of function, 496, 508. 

-~ of parameters, 430, 445. 

Vector field, 82. ‘ 

Vectors, 3 et seg. 

~~ components of, 5. 

— curl of, 92, 393, 404. 

— divergence of, 91 et seg., 404. 

— families of, 85. 

— field of, 82. 

— scalar product of, 7, 85. 

— vector product of, 13 et seqg., 85. 

Velocity field, 371. 

— vector, 87. 

Vibrations, forced, 448-9. 

Volume, 223 et seq., 266, 387. 

— in polar co-ordinates, 267. 

— of n-dimensional sphere, 300-7, 

~~ of region bounded by planes, 294 
et seq. 

— of solid of revolution 266-7, 

— of tetrahedron, 18, 27. 

—— orientation of, 380. 


Wave equation, 481-5. 

Waves, 484-5. 

Weierstrass’s infinite product for gamma 
function, 333. 

Work, 343, 350, 373, 414. 

Wronskian, 440, 442. 


Zeros of analytic function, 551, 553, 559. 
Zeta function, 545, 568. 
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